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Translator’s  Note 


This  book  is  a  translation  of  A.  N.  Kolmogorov  and  S.  V.  Fomin’s  book 
“Elementy  Teorii  Funktsii  i  Funktsional’nogo  Analiza.,  II.  Mera,  Inte¬ 
gral  Lebega  i  Prostranstvo  Hilberta”  (1960). 

An  English  translation  of  the  first  part  of  this  work  was  prepared  by 
Leo  F.  Boron  and  published  by  Graylock  Press  in  1957  and  is  mentioned 
in  [A]  of  the  suggested  reading  matter  added  at  the  end  of  the  present 
book. 

The  approach  adopted  by  the  Russian  authors  should  be  of  great 
interest  to  many  students  since  the  concept  of  a  semiring  is  introduced 
early  on  in  the  book  and  is  made  to  play  a  fundamental  role  in  the  subse¬ 
quent  development  of  the  notions  of  measure  and  integral.  Of  particular 
value  to  the  student  is  the  initial  chapter  in  which  all  the  ideas  of  meas¬ 
ure  are  introduced  in  a  geometrical  way  in  terms  of  simple  rectangles  in 
the  unit  square.  Subsequently  the  concept  of  measure  is  introduced  in 
complete  generality,  but  frequent  back  references  to  the  simpler  intro¬ 
duction  do  much  to  clarify  the  more  sophisticated  treatment  of  later 
chapters. 

A  number  of  errors  and  inadequacies  of  treatment  noted  by  the 
Russian  authors  in  their  first  volume  are  listed  at  the  back  of  their  second 
book  and  have  been  incorporated  into  our  translation.  The  only  change 
we  have  made  in  this  addenda  is  to  re-reference  it  in  terms  of  the  English 
Translation  [A]. 

In  this  edition  the  chapters  and  sections  have  been  renumbered  to  make 
them  independent  of  the  numbering  of  the  first  part  of  the  book  and  to 
emphasize  the  self-contained  character  of  the  work. 

New  York,  January  1961 

Natascha  Artin  Brunswick 
Alan  Jeffrey 


Foreword 


This  publication  is  the  second  book,  of  the  “Elements  of  the  Theory  of 
Functions  and  Functional  Analysis,”  the  first  book  of  which  (“Metric 
and  Normed  Spaces”)  appeared  in  1954.  In  this  second  book  the  main 
role  is  played  by  measure  theory  and  the  Lebesgue  integral.  These  con¬ 
cepts,  in  particular  the  concept  of  measure,  are  discussed  with  a  sufficient 
degree  of  generality;  however,  for  greater  clarity  we  start  with  the 
concept  of  a  Lebesgue  measure  for  plane  sets.  If  the  reader  so  desires  he 
can,  having  read  §1,  proceed  immediately  to  Chapter  II  and  then  to 
the  Lebesgue  integral,  taking  as  the  measure,  with  respect  to  which  the 
integral  is  being  taken,  the  usual  Lebesgue  measure  on  the  line  or  on  the 
plane. 

The  theory  of  measure  and  of  the  Lebesgue  integral  as  set  forth  in  this 
book  is  based  on  lectures  by  A.  N.  Kolmogorov  given  by  him  repeatedly 
in  the  Mechanics-Mathematics  Faculty  of  the  Moscow  State  University. 
The  final  preparation  of  the  text  for  publication  was  carried  out  by  S.  V. 
Fomin. 

The  two  books  correspond  to  the  program  of  the  course  “Analysis  III” 
which  was  given  for  the  mathematics  students  by  A.  N.  Kolmogorov. 

At  the  end  of  this  volume  the  reader  will  find  corrections  pertaining 
to  the  text  of  the  first  volume. 

A.  N.  Kolmogorov 
S.  V.  Fomin 
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CHAPTER  I 


MEASURE  THEORY 


The  concept  of  the  measure  n(A)  of  a  set  A  is  a  natural  gener¬ 
alisation  of  the  following  concepts: 

1)  the  length  1(A)  of  a  segment  A; 

2)  the  area  S(F)  of  a  plane  figure  F; 

3)  the  volume  V(G)  of  a  figure  G  in  space; 

4)  the  increment  <p(6)  —  <p(a)  of  a  non-decreasing  function 
<p(t)  on  the  half  open  line  interval  [a,  b ) ; 

5)  the  integral  of  a  non-negative  function  taken  over  some 
line,  plane  surface  or  space  region,  etc. 

The  concept  of  the  measure  of  a  set,  which  originated  in  the 
theory  of  functions  of  a  real  variable,  has  subsequently  found 
applications  in  probability  theory,  the  theory  of  dynamic  systems, 
functional  analysis  and  other  fields  of  mathematics. 

In  the  first  section  of  this  chapter  we  shall  introduce  the  notion 
of  measure  for  sets  in  a  plane,  starting  from  the  concept  of  the 
area  of  a  rectangle.  The  general  theory  of  measures  will  be  given 
in  §§3-7.  The  reader  will  however  easily  notice  that  all  arguments 
used  in  §1  are  of  a  general  character  and  can  be  re-phrased  for  the 
abstract  theory  without  essential  changes. 

1 .  Measure  of  Plane  Sets 

Let  us  consider  a  system  ©  of  sets  in  the  (x,  y)  -plane,  each  of 
which  is  given  by  one  of  the  inequalities  of  the  form 

a  <  x  <  b, 
a  <  x  <  b, 
a  <  x  <  b, 
a  <  x  <  b, 
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together  with  one  of  the  inequalities 

c  <  y  <  d, 
c  <  y  <  d, 
c  <  y  <  d, 
c  <  y  <  d, 

where  a,  b,  c  and  d  are  arbitrary  numbers.  The  sets  belonging  to 
this  system  we  shall  call  “rectangles”.  The  closed  rectangle,  given 
by  the  inequalities 

a  <  x  <  b,  c  <  y  <  d 

is  the  usual  rectangle  (including  the  boundary)  if  a  <  b  and 
c  <  d,  or  a  segment  if  a  =  b  and  c  <  d,  or  a  point  if  a  =  b  and 
c  =  d.  The  open  rectangle 

a  <  x  <  b,  c  <  y  <  d 

is,  depending  on  the  relations  among  a,  b,  c  and  d,  respectively,  a 
rectangle  without  boundaries,  or  the  empty  set.  Each  of  the 
rectangles  of  the  other  types  (let  us  call  them  half-open)  is 
either  a  real  rectangle  without  one,  two  or  three  sides,  or  an  inter¬ 
val,  or  a  half-interval,  or,  finally,  an  empty  set. 

We  shall  define  for  each  of  the  rectangles  a  measure  correspond¬ 
ing  to  the  concept  of  area,  well  known  from  elementary  geometry, 
in  the  following  way: 

a)  the  measure  of  an  empty  set  is  equal  to  zero; 

b)  the  measure  of  a  non-empty  rectangle  (closed,  open  or 
half -open)  given  by  the  numbers  a,  b,  c  and  d  is  equal  to 

(6  —  a)  (d  —  c). 

In  this  way,  to  each  rectangle  P  there  corresponds  a  number 
m(P) — the  measure  of  this  rectangle;  moreover  the  following 
conditions  are  obviously  satisfied : 

1)  the  measure  m(P)  takes  on  real  non-negative  values; 

n 

2)  the  measure  is  additive,  i.e.,  if  P  =  U  Pk  and  n  Pk  =  0 
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for  i  k,  then  n 

m{P)  =  £  m(Pk). 

•  Jfc=l 

Our  task  now  is  to  generalise  the  measure  m{P),  which  up  to  now 
has  been  defined  only  for  rectangles,  to  a  wider  class  of  measures, 
preserving  the  properties  1)  and  2). 

The  first  step  in  this  direction  consists  of  generalising  the  con¬ 
cept  of  measure  to  so  called  elementary  sets.  We  shall  call  a  plane 
set  elementary  if  it  can  be  represented,  at  least  in  one  way,  as  a 
union  of  a  finite  number  of  pairwise  non-intersecting  rectangles. 
For  what  follows  we  shall  use  the  following 

Theorem  1.  The  union,  intersection,  difference  and  symmetric 
difference  of  two  elementary  sets  is  also  an  elementary  set. 

Proof.  It  is  clear  that  the  intersection  of  two  rectangles  is  again 
a  rectangle.  Therefore,  if 

A  =  U  Pk  and  B  =  U  Q; 

k  i 

are  two  elementary  sets,  then 

A  n  B  =  U  {Pk  n  Qj) 

k.i 


is  also  an  elementary  set. 

The  difference  of  two  rectangles  is,  as  is  easily  checked,  an  ele¬ 
mentary  set.  Hence,  taking  away  from  the  rectangle  some  ele¬ 
mentary  set  we  again  obtain  an  elementary  set  (as  the  intersection 
of  elementary  sets).  Now  let  A  and  B  be  two  elementary  sets. 
One  can  obviously  find  a  rectangle  P  containing  each  of  them. 
Then 

A  U  B  =  P\1(P\A)  n  (P\B)2 

is  an  elementary  set  by  what  has  been  said  above.  From  this  and 
the  equalities 


A\B  =  A  n  ( P\B ) 
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and 

A  A  B  =  (A  U  B)\(A  n  B) 

it  follows  that  the  difference  and  the  symmetric  difference  of 
elementary  sets  are  also  elementary  sets.  The  theorem  is  thus 
proved. 

Let  us  now  define  the  measure  m'(A)  for  elementary  sets  in 
the  following  way:  if 

4  =  UL, 

where  the  Pk  are  pairwise  non-intersecting  rectangles,  then 
m'(A)  =  D  m(Pk). 

Let  us  show  that  m'(A)  does  not  depend  on  the  way  the  set  A  is 
represented  as  a  sum  of  rectangles.  Let 

A  =  U  Pk  =  U  Q,-, 

k  i 

where  the  Pk  and  the  Q,  are  rectangles  and  Pi  n  Pk  =  0, 
Qi  n  Qk  ~  0  for  i  0  k.  Since  the  intersection  of  two  rectangles  is 
a  rectangle  we  have,  because  the  measures  of  rectangles  are 
additive, 

D m{Pk )  =  m(Pk  D  Q/)  =  Dm ((?,). 

*  k.j  i 

It  is  easy  to  see  that  the  measure  of  elementary  sets  defined  in 
this  way  is  non-negative  and  additive. 

Let  us  establish  the  following  property  of  elementary  measures 
which  will  be  important  in  what  follows. 

Theorem  2.  If  A  is  an  elementary  set  and  {A„}  is  a  finite  or 
countable  system  of  elementary  sets  such  that 

A  C  U  An, 

n 

then 

m'(A)  <  DmV»).  (*) 

n 

Proof.  For  an  arbitrary  e  >  0  and  a  given  A  we  can  obviously 
find  a  closed  elementary  set  A,  which  is  contained  in  A  and 
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satisfies  the  conditions 


m'(i)  >  m'U)  ~ 

(For  this  it  is  sufficient  to  replace  each  of  the  k  rectangles  P{ 
which  form  A  by  the  closed  rectangle  which  is  wholly  contained 
in  it  and  having  an  area  larger  than  m(P.)  —  e/2k+l.) 

Furthermore,  for  each  An  one  can  find  an  open  elementary  set 
An  containing  A„  and  satisfying  the  condition 

m'{An)  <  m'(An)  +  -“fl¬ 
it  is  clear  that 

A  C  U  An- 

From  {An\  we  can  (by  the  Borel-Lebesgue  lemma)  select  a  sys¬ 
tem  Am,  Anv  •  •  •,  A„t  which  covers  A.  Here  moreover,  obviously 

m\A)  <  Ew'dJ 

t=l 

(since  otherwise  A  could  be  covered  by  a  finite  number  of  rec¬ 
tangles  having  an  added  area  which  is  less  than  m' (A),  which  is 
obviously  impossible).  Therefore, 

m'(A)  <  m'(A)  +  ~  <  X)m'(Ini)  +  -  <  J^m'(An)  +  ^ 

^  i=l  Z  n  Z 


<  L  m'(An)  +  L  ^  \  =  L  m'(An )  +  e, 

yielding  ( * ) ,  since  e  is  arbitrary  and  positive. 

The  set  of  elementary  sets  does  not  exhaust  all  those  sets  which 
were  considered  in  elementary  geometry  and  in  classical  analysis. 
Therefore  it  is  natural  to  ask  the  question  how  to  generalise  the 
concept  of  a  measure  to  a  class  of  sets  which  is  wider  than  the 
finite  combinations  of  rectangles  with  sides  parallel  to  the  co¬ 
ordinate  axes,  preserving  its  basic  properties. 
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The  final  solution  of  this  problem  was  given  by  H.  Lebesgue  at 
the  beginning  of  the  twentieth  century. 

In  presenting  the  theory  of  Lebesgue’s  measure  we  will  have 
to  consider  not  only  finite  but  also  infinite  combinations  of  rect¬ 
angles. 

In  order  to  avoid  dealing  with  infinite  values  for  measures  we 
shall  limit  ourselves  to  sets  which  fully  belong  to  the  square 
E  =  {0  <  x  <  1;  0  <  y  <  1). 

On  the  set  of  all  these  sets  we  shall  define  two  functions  n*  (A) 
and  ju*  ( A )  in  the  following  way. 

Definition  1 .  We  shall  call  the  number 

inf 

i  =  U  Pk 

the  outer  measure  n*{A )  of  the  set  A;  the  lower  bound  is  taken 
over  all  possible  coverings  of  the  set  A  by  finite  or  countable 
rectangles. 

Definition  2.  We  call  the  number 


1  -  n*(E\A) 

the  inner  measure  n*(A)  of  the  set  A.  It  is  easy  to  see  that  always 

m*(A)  <  m*(A). 

Indeed,  suppose  that  for  some  A  C  E 


m*(A)  >  n*(A), 
i.e., 

m*(A)  +  n*(E\A)  <  1. 

Then,  by  definition  of  the  exact  lower  bound,  one  can  find  systems 
of  rectangles  {P,-}  and  \Qk\  covering  A  and  E\A,  respectively, 
such  that 


Z ^  m(Qk )  <  1. 


The  union  of  the  systems  {P,}  and  {Q*}  we  shall  denote  by 
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{/?/};  we  obtain 

E  C  U  Rj  and  m(E )  >  ^  m(R ,•) . 

i  3 

which  contradicts  Theorem  2. 


Definition  3.  We  call  a  set  A  measurable  (in  the  sense  of  Lebesgue), 
if 


n*U)  =  M*U). 


The  common  value  n(A )  of  the  outer  and  inner  measure  for  a 
countable  set  A  is  called  its  Lebesgue  measure. 


Let  us  find  the  basic  properties  of  the  Lebesgue  measure  and  of 
countable  sets. 


Theorem  3.  If 


n 


where  A„  is  a  finite  or  countable  system  of  sets,  then 
M*(d)  < 

n 

Proof.  By  the  definition  of  the  outer  measure,  we  can  find  for 
each  An  a  system  of  rectangles  \Pnk },  finite  or  countable,  such 
that  An  c  U  Pnk  and 

k 


Zm(Pnk)  <  i 

k  £ 

where  e  >  0  is  selected  arbitrarily.  Then 

ACUU  Pnk, 

n  k 

and 

H*{A)  <  EEm(Prt)  <  +  e. 

n  k  n 

Since  e  >  0  is  arbitrary,  this  establishes  the  theorem. 

We  have  already  introduced  above  the  concept  of  measure  for 
sets  which  we  called  elementary.  The  theorem  below  shows  that 
for  elementary  sets  Definition  3  leads  to  exactly  the  same  result. 
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Theorem  4.  Elementary  sets  are  measurable,  and  for  them  the 
Lebesgue  measure  coincides  with  the  measure  m'(A)  constructed 
above. 

Proof.  If  A  is  an  elementary  set  and  Ph  P2,  •  •  •,  Pk  are  the  pair¬ 
wise  non-intersecting  rectangles  comprising  it,  then,  by  definition, 

m-'(A)  =  23  m(Pi) . 

*= 1 

Since  the  rectangles  P ,  cover  all  of  A, 

n*(A)  <  23  m(Pi)  =  m'(A). 

i 

But,  if  {Qy}  is  an  arbitrary  finite  or  countable  system  of  rec¬ 
tangles  covering  A,  then,  by  Theorem  2,  m'(A)  <  m(Qj). 
Hence  m'(A)  <  m*( A ).  Therefore,  m'( A)  = 

Since  E\A  is  also  an  elementary  set,  m'(E\A)  =  n*(E\A). 
But 

m'(E\A)  =  1  —  m’(A )  and  fi*(E\A )  =  1  —  n*(A), 
yielding 

m'(A)  =  n*(A). 

Therefore 

m'(A )  =  ix*(A)  =  ih{A)  =  fi(A). 

From  the  result  obtained  we  see  that  Theorem  2  is  a  special 
case  of  Theorem  3. 

Theorem  5.  In  order  that  the  set  A  be  measurable,  it  is  necessary 
and  sufficient  that  the  following  condition  be  satisfied:  for  any 
s  >  0  there  exists  an  elementary  set  B,  such  that, 

H*(A  A  B)  <  e. 

Thus  those  sets  and  only  those  sets  are  measurable  which  can 
be  “approximated  with  an  arbitrary  degree  of  accuracy”  by 
elementary  sets.  For  the  proof  of  Theorem  5  we  shall  need  the 
following 

Lemma.  For  two  arbitrary  sets  A  and  B 

|m*(A)  A  B). 
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Proof  of  the  lemma.  Since 

A  C  B  U  (A  A  B), 

we  have 

n*(A)  <  m*CB)  +  M*U  A  B). 

This  implies  the  lemma  if  n*(A)  >  n*(B).  If  n*(A)  <  n*(B), 
then  the  lemma  follows  from  the  inequality 

n*(B)  <  m*U)  +  A  B), 

which  can  be  established  in  an  analagous  manner. 

Proof  of  Theorem  5.  Sufficiency.  Let  us  assume  that  for  any 
8  >  0  there  exists  an  elementary  set  B,  such  that 

m*(T  AS)  <e. 

Then 

|m*U)  -  =  |M*U)  -  <  «,  (1) 

and  since 


(E\A)  A  (E\B)  =  A  A  B, 
we  have  analogously  that 

\»*(E\A)  -  m'(E\B)\  <  e.  (2) 

From  the  inequalities  (1)  and  (2)  we  have,  taking  into  account 
m'(B)  +  m'(E\B)  =  m'(E)  =  1, 

that 

I m*(A)  +  p*(E\A)  -  1|  <  28, 
and,  since  e  >  0  is  arbitrary, 

M*U)  +  n*(E\A)  =  1, 
i.e.,  the  set  A  is  measurable. 

Necessity.  Let  A  be  measurable,  i.e.,  let 


M*U)  +M*(^)  =  1. 
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Selecting  e  >  0  arbitrarily,  we  shall  seek  coverings 
icUB,  and  E\A  c  U  C» 

n  n 

for  the  sets  A  and  E\A  by  systems  of  rectangles  [Bn]  and  \Cn) 
for  which 

£»(*„)  <  m*(A)  +  -6  and  I>«7„)  <  ?*(E\A)  +  -6. 

n  n 

Since  Z  m(Bn)  <  00 ,  we  can  find  an  iV  such  that 


<  -S. 

n>,V  *5 

Set 

N 

B  =  U  B„. 

n=l 

It  is  clear  that  the  set 

P  =  U  Bn 

n>N 


contains  A  \B,  and  that  the  set 

Q  =  U  (B  n  Cn) 

n 

contains  B\A,  and  therefore  that  A  A  B  c  P  u  Q.  Hence, 
M*(P)  <  Z™(Bn)  <  l 

n>N  O 

Let  us  evaluate  n*(Q).  For  this  let  us  note  that 


(u  u  (u  (C.\P))  =  E, 

and  hence 

Z  m(BH)  +  Z  m'(Cn\B)  >  1. 


(3) 
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But  by  assumption, 

£  m(Bn)  +  £  m(Cn)  <  M*U)  +  »*(E\A)  +  %  =  1  +  %  (4) 

n  n  O  O 

Subtracting  (3)  from  (4)  we  have 

£  m(Cn)  ~  £  m'(C„\B)  =  £m'(C„nfi)  < 

n  n  n  <5 

i.e., 

n*(Q)  <  |- 

Therefore, 

?*{A  A  B)  <  n*(P)  +  n*(Q)  <  e. 

Hence,  if  A  is  measurable,  there  exists  for  any  arbitrary  e  >  0, 
an  elementary  set  B  such  that  fi*(A  A  B)  <  e.  Theorem  5  is 
thus  established. 

Theorem  6.  The  union  and  intersection  of  a  finite  number  of 
measurable  sets  are  measurable  sets. 

Proof.  It  is  clear  that  it  suffices  to  give  the  proof  for  two  sets. 
Let  A i  and  A2  be  measurable  sets.  Then,  for  any  s  >  0,  one  can 
find  elementary  sets  B  i  and  Bt  such  that, 

m*Mi  A  Si)  <  ~  ,  »*(A2  A  Bt)  <  j- 

Since 

( Ai  U  A2)  A  (Bt  U  Bf)  (di  A  Bi )  U  ( A2  A  B2), 

we  have 

n*l(A !  U  At)  A  (Bt  UB,)]<  A  Bt)  +  m*(42  A  Bt)  <  e.  (5) 

u  fi2  is  an  elementary  set;  hence,  by  Theorem  4,  the  set  .4 ,  u  A2 
is  measurable.  But,  just  by  the  definition  of  measurability,  if  A 
is  measurable,  then  E\A  is  also  measurable;  hence  the  fact  that 
the  intersection  of  two  sets  is  measurable  follows  from  the  relation 

4.n42  =  E\l(E\At)  U  (E\A,n 
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Corollary.  The  difference  and  symmetric  difference  of  two  meas¬ 
urable  sets  is  measurable. 

This  follows  from  Theorem  6  and  the  equations 

Ax\A,  =  A,  n  (E\A»),  1,  Al,=  (Ai\A,)  U  (A,\Ai). 

Theorem  7.  If  Ah  A2,  •••,  An  are  pairwise  non-intersecting 
measurable  sets,  then 

M  (0  A*)  =  ^/i(Ai-). 

\k=l  /  *=1 

Proof.  As  in  Theorem  6  it  suffices  to  consider  the  case  n  =  2. 
Let  us  select  an  arbitrary  e  >  0  and  elementary  sets  Ff  and  B% 
such  that 

m*(A,  A  BF)  <  e,  (6) 

n*(A2  A  B2)  <  e.  (7) 

We  set  A  =  Ai  u  A2  and  B  =  B  iU  B2.  The  set  A  is  measurable 
by  Theorem  6.  Since 

' B\  fl  B2  ^  (Ai  A  B\)  U  ( A2  A  B2) , 

we  have 


m'(Bi  n  B2)  <  2e.  (8) 

By  the  lemma  to  Theorem  5,  and  from  (6)  and  (7)  we  have 

\m'(BI)  —  m*(Ai)I  <  e,  (9) 

|m'(&)  -  m*U,)|  <  e.  (10) 

Since  on  the  set  of  elementary  sets  the  measure  is  additive,  we 
obtain  from  (8),  (9)  and  (10) 

m'(B)  =  m'(Bi)  +  m'(&)  -  m'{B ,  n  B2)  >  n*(Af)  +  M*(A2)  -  4s. 

Noting  moreover  that  A  A  B  cz  (Ai  A  Bi)  u  {A2  A  B2)  we 

finally  have 

p*(A)  >  m'(B)  -  m*(A  A  B)  >  m'(B)  -  2e 
>  ix*(Af)  +ix*(A2)  -  6e. 
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Since  6e  can  be  made  arbitrarily  small, 

/**(A)  >  M*(At)  +  m*(A2). 

The  reverse  inequality 

m*(A)  <  ?*(A0  +  »*(A2) 

is  always  true  for  A  =  Ai  u  A 2,  therefore,  we  finally  obtain 
m*(A)  =  m*(A,)  + 

Since  Ai,  A2  and  A  are  measurable,  we  can  replace  /x*  by  ju.  The 
theorem  is  thus  established. 

Theorem  8.  The  union  and  the  intersection  of  a  countable  number 
of  measurable  sets  are  measurable  sets. 

Proof.  Let 

A i,  A2,  •••,  An,  ■  ■  ■ 

co 

be  a  countable  system  of  measurable  sets  and  A  =  U  An.  Let  us 

n— 1  co  n=  1 

set  An  =  A„\U  Ak.  It  is  clear  that  A  =  U  A'n,  where  the  sets  A'n 

A  =  1  n=  1 

are  pairwise  non-intersecting.  By  Theorem  6  and  its  corollary, 
all  the  sets  A'n  are  measurable.  According  to  Theorems  7  and  3, 
for  any  n 

L(4D  =  m(U  A'k)<  m(A), 

Therefore  the  series 

£m(a.o 

n=l 

converges  and  hence  for  any  e  >  0  one  can  find  an  N  such  that 

ai) 

n>N  * 

N 

Since  the  set  C  =  U  A'n  is  measurable  (being  a  union  of  a  finite 
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number  of  measurable  sets),  we  can  find  an  elementary  set  B 
such  that 


(12) 

Since 

A  A  8  C  (C  A  B)  U  (  U  4'), 

(11)  and  (12)  yield 

lx*(A  A  B)  <  s. 

Because  of  Theorem  5  this  implies  that  the  set  A  is  measurable. 

Since  the  complement  of  a  measurable  set  is  itself  measurable, 
the  statement  of  the  theorem  concerning  intersections  follows 
from  the  equality 

fl  An  =  E\U(E\An). 

n  n 

Theorem  8  is  a  generalisation  of  Theorem  6.  The  following 
Theorem  is  a  corresponding  generalisation  of  Theorem  7. 

Theorem  9.  If  [An\  is  a  sequence  of  pairwise  non-intersecting 
measurable  sets,  and  A  =  U  A„,  then 

n 

m(-A)  =  m(-4„). 

n 

Proof.  By  Theorem  7,  for  any  N, 

m(U  An)=  jr^An)  <  n(A). 

\  n= 1  /  n=l 

Taking  the  limit  as  N  — >  °° ,  we  have 

mU)>£m(4„).  (13) 

71=1 

On  the  other  hand,  according  to  Theorem  3, 

v(A)  <  £m(^„).  (14) 

n= 1 

Inequalities  (13)  and  (14)  yield  the  assertion  of  the  Theorem. 
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The  property  of  measures  established  in  Theorem  9  is  called  its 
countable  additivity,  or  a-additivity.  An  immediate  corollary  of 
^-additivity  is  the  following  property  of  measures,  called  con¬ 
tinuity. 

Theorem  10.  If  A,  2  2  —  is  a  sequence  of  measurable  sets, 

contained  in  each  other,  and  A  =  fl  A„,  then 


m(A)  =  lim  m(A„). 

n-*  oo 

It  suffices,  obviously,  to  consider  the  case  A  =  0,  since  the 
general  case  can  be  reduced  to  it  by  replacing  A„  by  An\A.  Then 

Al  =  (Al\A2)  U  (A2\As)  U  •  •  • 

and 


A  n  —  (  A  n\A  n+l)  U  (  A  n+l\  A  „+2)  9  *  '  *  • 


Therefore 


m(A.)  =  Em(A,\A,+1),  (15) 

k= 1 

and 

m(a„)  =  (16) 

k=n 

since  the  series  (15)  converges,  its  remainder  term  (16)  tends  to 
zero  as  n  — »  » .  Thus, 

m(A„)  — >  0  for  n— ►  oo, 

which  was  to  be  shown. 


Corollary.  If  At  C  A2  C  •••  is  an  increasing  sequence  of  meas¬ 
urable  sets,  and 


A  =  U  An, 

n 


then 


n(A)  =  lim  ju(A„). 


n~*co 
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For  the  proof  it  suffices  to  go  over  from  the  sets  An  to  their 
complements  and  to  use  Theorem  10. 

Thus  we  have  generalised  the  concept  of  a  measure  from  ele¬ 
mentary  sets  to  a  wider  class  of  sets,  called  measurable  sets, 
which  are  closed  with  respect  to  the  operations  of  countable 
unions  and  intersections.  The  measure  constructed  is  <r-additive 
on  this  class  of  sets. 

Let  us  make  a  few  final  remarks. 

1.  The  theorems  we  have  derived  allow  us  to  obtain  an  idea  of 
the  set  of  all  Lebesgue  measurable  sets. 

Since  every  open  set  belonging  to  E  can  be  represented  as  a 
union  of  a  finite  or  countable  number  of  open  rectangles,  i.e., 
measurable  sets,  Theorem  8  implies  that  all  open  sets  are  meas¬ 
urable.  Closed  sets  are  complements  of  open  sets  and  consequently 
are  also  measurable.  According  to  Theorem  8  also  all  those  sets 
must  be  measurable  which  can  be  obtained  from  open  or  closed 
sets  by  a  finite  or  countable  number  of  operations  of  countable 
unions  and  intersections.  One  can  show,  however,  that  these  sets 
do  not  exhaust  the  set  of  all  Lebesgue  measurable  sets. 

2.  We  have  considered  above  only  those  plane  sets  which  are 

subsets  of  the  unit  square  E  =  { 0  <  x,  y  <  1 } .  It  is  not  difficult 
to  remove  this  restriction,  e.g.,  by  the  following  method.  Repre¬ 
senting  the  whole  plane  as  a  sum  of  squares  Enm  =  \n  <  x  < 
n  +  1,  m  <  y  <  m  +  1}  ( m ,  n  integers),  we  shall  say  that  the 
plane  set  A  is  measurable  if  its  intersection  Anm  =  A  n  Enm  with 
each  of  these  squares  is  measurable,  and  if  the  series  ^  n(Anm) 
converges.  Here  we  assume  by  definition  that  n,m 

m(A)  =  ^  n(Anm). 

n,m 

All  the  properties  of  measures  established  above  can  obviously 
be  carried  over  to  this  case. 

3.  In  this  section  we  have  given  the  construction  of  Lebesgue 
measures  for  plane  sets.  Analogously  Lebesgue  measures  may  be 
constructed  on  a  line,  in  a  space  of  three  dimensions  or,  in  general, 
in  a  space  of  n  dimensions.  In  each  of  these  cases  the  measure  is 
constructed  by  the  same  method:  proceeding  from  a  measure  de- 
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fined  earlier  for  some  system  of  simple  sets  (rectangles  in  the 
case  of  a  plane,  intervals  (a,  b),  segments  [a,  6]  and  half-lines 
(a,  6]  and  [a,  b )  in  the  case  of  a  line,  etc.)  we  first  define  a  measure 
for  finite  unions  of  such  sets,  and  then  generalise  it  to  a  much 
wider  class  of  sets — to  sets  which  are  Lebesgue  measurable.  The 
definition  of  measurability  itself  can  be  carried  over  word  for 
word  to  sets  in  spaces  of  any  dimension. 

4.  Introducing  the  concept  of  the  Lebesgue  measure  we 
started  from  the  usual  definition  of  an  area.  The  analogous  con¬ 
struction  for  one  dimension  is  based  on  the  concept  of  length  of 
an  interval  (segment,  half-line).  One  can  however  introduce 
measure  by  a  different  and  more  general  method. 

Let  F(t)  be  some  non-decreasing  function,  continuous  from  the 
left,  and  defined  on  a  line.  We  set 

m(a,  b )  =  F(b)  —  F(a  +  0), 

m[a,  =  F(b  +  0)  —  F (a), 

m(a,  6]  =  F(b  +  0)  —  F(a  +  0), 

m[a,  b )  =  F(b)  —  F(a). 

It  is  easy  to  see  that  the  function  of  the  interval  m  defined  in  this 
way  is  non-negative  and  additive.  Applying  to  it  considerations 
which  are  analogous  to  the  ones  used  in  this  section,  we  can  con¬ 
struct  some  measure  /xf(A).  Here  the  set  of  sets,  which  are  meas¬ 
urable  with  respect  to  the  given  measure,  is  closed  with  respect  to 
the  operations  of  countable  unions  and  intersections,  and  the 
measure  mf  is  <r-additive.  The  class  of  sets,  measurable  with  re¬ 
spect  to  mf,  will,  generally  speaking,  depend  on  the  choice  of  the 
function  F.  However,  for  any  choice  of  F,  the  open  and  closed 
sets,  and  therefore  all  their  countable  unions  and  intersections 
will  obviously  be  measurable.  Measures  obtained  from  some 
function  F  are  called  Lebesgue-Stieltjes  measures.  In  particular, 
to  the  function  F(t)  =  t  there  corresponds  the  usual  Lebesgue 
measure  on  a  line. 

If  the  measure  nF  is  such  that  it  is  equal  to  zero  for  any  set 
whose  normal  Lebesgue  measure  is  zero,  then  the  measure  mf  is 
called  completely  continuous.  If  the  measure  is  wholly  concen- 
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trated  on  a  finite  or  countable  set  of  points  (this  happens  when 
the  set  of  values  of  the  function  F(t)  is  finite  or  countable),  then 
it  is  called  discrete.  The  measure  y.F  is  called  singular,  if  it  is  equal 
to  zero  for  any  one  point  set,  and  if  there  exists  such  a  set  M 
with  Lebesgue  measure  zero,  that  the  measure  nF  of  its  comple¬ 
ment  is  equal  to  zero. 

One  can  show  that  any  measure  nF  can  be  represented  as  a 
sum  of  an  absolutely  continuous,  a  discrete  and  a  singular  measure. 

•  Existence  of  Non-measurable  Sets 

It  has  been  shown  above  that  the  class  of  Lebesgue  measurable  sets 
is  quite  wide.  Naturally  the  question  arises  as  to  whether  there  exist 
any  non-measurable  sets  at  all.  We  shall  show  that  this  problem  is 
solved  affirmatively.  It  is  easiest  to  constmct  non-measurable  sets  on  a 
circle. 

Consider  a  circle  with  circumference  C  of  unit  length  and  let  a  be 
some  irrational  number.  Let  us  consider  those  points  of  the  circum¬ 
ference  C  which  can  be  transformed  into  each  other  by  a  rotation  of  the 
circle  through  an  angle  na  (n  an  integer)  as  belonging  to  our  class.  Each 
of  these  classes  will  obviously  consist  of  a  countable  number  of  points. 
Let  us  now  select  from  each  of  these  classes  one  point.  We  shall  show 
that  the  set  obtained  in  this  manner,  let  us  call  it  <h,  is  non-measurable. 
Let  us  call  the  set,  obtained  from  $  by  a  rotation  through  angle  na, 
<!>„.  It  is  easily  seen  that  all  the  sets  <f>„  are  pairwise  nonintersecting,  and 
that  they  add  up  to  the  whole  circular  arc  C.  If  the  set  $  were  meas¬ 
urable,  then  the  sets  <b„,  congruent  to  it,  would  also  be  measurable.  Since 

oo 

C  =  U  $n,  n  $„  =  0  for  ti  ^  m, 

n— — oo 

this  would  imply,  on  the  strength  of  the  <7-additivity  of  measures  that, 

1  =  £  m(*»).  (17) 

71= — OO 

But  congruent  sets  must  have  the  same  measure: 

m(3>«)  =  M  (•*>)• 

This  shows  that  equation  (17)  is  impossible,  since  the  sum  of  the  series 
on  the  right-hand  side  of  equation  (17)  is  equal  to  zero  if  n($)  =  0, 
and  is  infinite  if  n($)  >  0.  Thus  the  set  3>  (and  hence  also  each  3>n)  is 
non-measurable. 
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2.  Systems  of  Sets 

Before  proceeding  to  the  general  theory  of  sets  we  shall  first 
give  some  information  concerning  systems  of  sets,  which  sup¬ 
plements  the  elements  of  set  theory  discussed  in  Chapter  I  of 
Volume  I. 

We  shall  call  any  set,  the  elements  of  which  are  again  certain 
sets,  a  system  of  sets.  As  a  rule  we  shall  consider  systems  of  sets, 
each  of  which  is  a  subset  of  some  fixed  set  X.  We  shall  usually 
denote  systems  of  sets  by  Gothic  letters.  Of  basic  interest  to  us 
will  be  systems  of  sets  satisfying,  with  respect  to  the  operations 
introduced  in  Chapter  I,  §1  of  Volume  I,  certain  definite  condi¬ 
tions  of  closure. 

Definition  1 .  A  non-empty  system  of  sets  9f  is  called  a  ring  if  it 
satisfies  the  conditions  that  A  6  9?  and  B  £  9?  implies  that  the 
sets  A  A  B  and  A  fl  B  belong  to  9?. 

For  any  A  and  B 

Au  B  =  (A  A  B)  A  (A  D  B), 

and 

A\B  =  A  A(infi); 

hence  A  £  9J  and  B  £  9f  also  imply  that  the  sets  A  u  B  and 
A\B  belong  to  9?.  Thus  a  ring  of  sets  is  a  system  which  is  in¬ 
variant  with  respect  to  the  operations  of  union  and  intersection, 
subtraction  and  the  formation  of  a  symmetric  difference.  Ob¬ 
viously  the  ring  is  also  invariant  with  respect  to  the  formation  of 
any  finite  number  of  unions  and  intersections  of  the  form 

C  =  U  Ak,  D  =  f\  Ak. 

A=*l  k=l 

Any  ring  contains  the  empty  set  0,  since  always  A  \A  =  0. 
The  system  consisting  of  only  the  empty  set  is  the  smallest 
possible  ring  of  sets. 

A  set  E  is  called  the  unit  of  the  system  of  sets  @,  if  it  belongs 
to  ©  and  if,  for  any  A  £  ©,  the  equation 


holds. 


A  n  E  =  A 
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Hence  the  unit  of  the  system  of  sets  ©  is  simply  the  maximal 
set  of  this  system,  containing  all  other  sets  which  belong  to 

A  ring  of  sets  with  a  unit  is  called  an  algebra  of  sets. 

Examples.  1)  For  any  set  A  the  system  ©2(A)  of  all  its  subsets 
is  an  algebra  of  sets  with  the  unit  E  =  A. 

2)  For  any  non-empty  set  A  the  system  {  0,  A]  consisting  of 
the  set  A  and  the  empty  set  0,  forms  an  algebra  of  sets  with  the 
unit  E  =  A. 

3)  The  system  of  all  finite  subsets  of  an  arbitrary  set  A  forms 
a  ring  of  sets.  This  ring  is  then  and  only  then  an  algebra  if  the 
set  A  itself  is  finite. 

4)  The  system  of  all  bounded  subsets  of  the  line  forms  a  ring 
of  sets  which  does  not  contain  a  unit. 

From  the  definition  of  a  ring  of  sets  there  immediately  follows 

Theorem  1 .  The  intersection  32  =  f|  dt„  of  any  set  of  rings  is  also 
a  ring. 

Let  us  establish  the  following  simple  result  which  will  however 
be  important  in  the  subsequent  work. 

Theorem  2.  For  any  non-empty  system  of  sets  ©  there  exists  one 
and  only  one  ring  92  (@),  containing  ©  and  contained  in  an  arbi¬ 
trary  ring  92  which  contains  ©. 

Proof.  It  is  easy  to  see  that  the  ring  32  (@)  is  uniquely  defined 
by  the  system  ©.To  prove  its  existence  let  us  consider  the  union 
X  =  U  A  of  all  sets  A  contained  in  ©  and  the  ring  ©2  (X)  of  all 

Ae<& 

subsets  of  the  set  X.  Let  2  be  the  set  of  all  rings  of  sets  contained 
in  ©2  (X)  and  containing  ©.  The  intersection 

v  =  n  92 

SeS 


of  all  these  rings  will  obviously  be  the  required  ring  92(©). 
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Indeed,  whatever  the  91*  containing  ©,  the  intersection 
91  =  91*  n  ©1  ( X )  is  a  ring  of  2  and,  therefore, 

©  c  <3  c  9?  c  9?*, 

i.e.,  really  satisfies  the  requirement  of  being  minimal.  91(0)  is 
called  the  minimal  ring  over  the  system  0. 

The  actual  construction  of  the  ring  91(0)  for  a  given  system  © 
is,  generally  speaking,  fairly  complicated.  However,  it  becomes 
quite  straightforward  in  the  important  special  case  when  the 
system  ©  is  a  “semiring”. 

Definition  2.  A  system  of  sets  0  is  called  a  semiring  if  it  contains 
the  empty  set,  is  closed  with  respect  to  the  operation  of  intersec¬ 
tion,  and  has  the  property  that  if  A  and  i,  c  i  belong  to  ©, 

n 

then  A  can  be  represented  in  the  form  A  =  U  Ak,  where  the  Ak 

1 

are  pairwise  non-intersecting  sets  of  ©,  the  first  of  which  is  the 
given  set  Au 

In  the  following  pages  we  shall  call  each  system  of  non-inter¬ 
secting  sets  Ai,  A 2,  •  •  •,  A„,  the  union  of  which  is  the  given  set  A, 
&  finite  decomposition  of  the  set  A. 

Every  ring  of  sets  91  is  a  semiring  since  if  A  and  A,  Cl  A  belong 
to  9f,  then  the  decomposition 

A  —  Ai  u  A 2, 

where 

Ai  =  A\A;  €  91, 

takes  place. 

As  an  example  of  a  semiring  which  is  not  a  ring  of  sets  we  can 
take  the  set  of  all  intervals  (a,  b ),  segments  [a,  6]  and  half  seg¬ 
ments  (a,  6]  and  [a,  b )  on  the,real  axis.* 

In  order  to  find  out  how  the  ring  of  sets  which  is  minimal  over 
a  given  semiring  is  constructed,  let  us  establish  some  properties 
of  semirings  of  sets. 

*  Here,  of  course,  the  intervals  include  the  “empty”  interval  (a,  a)  and  the  segments 
consisting  of  one  point  [a,  a]. 
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Lemma  1.  Let  the  sets  Ah  A2,  •  An)  A  belong  to  the  semiring 
©,  and  let  the  sets  A  ,•  be  pairwise  non-intersecting  and  be  subsets  of 
the  set  A.  Then,  the  sets  A{(i  =  1,  2,  •  •  • ,  n)  can  be  included  as 
the  first  n  members  of  the  finite  decomposition 

s 

A  =  U  At,  s  >  n, 

k- 1 

of  the  set  A,  where  all  the  C  ©. 

Proof.  The  proof  will  be  given  by  induction.  For  n  =  1  the 
statement  of  the  lemma  follows  from  the  definition  of  a  semiring. 
Let  us  assume  that  the  result  is  true  for  n  =  m  and  let  us  con¬ 
sider  m  +  1  sets  A,,  A2,  •  •  • ,  Am,  Am+I  satisfying  the  conditions 
of  the  lemma.  By  the  assumptions  made, 

A  =  Ai  u  A2  u  •  •  •  u  Am  u  u  B2  u  •  •  •  u  Bv, 

where  the  sets  Bq  (q  =  1,  2,  •••,/>)  belong  to  ©.  Set 

Bq\  Am+1  Bq. 

By  the  definition  of  a  semiring,  we  have  the  decomposition 
Bq  =  Bql  U  Bqi  U  •  •  •  U  Bqr„ 

where  all  the  Bqj  belong  to  @.  It  is  easy  to  see  that 

V  rq 

A  =  Ai  u  •  •  •  uimu  Am+1  u  U  U  Bqj. 

g=l  7=2 

Thus  the  assertion  of  the  lemma  is  proved  for  n  =  m  +  1,  and 
hence  for  all  n. 

Lemma  2.  Whatever  the  finite  system  of  sets  Ah  A 2,  •  •  •,  An  be¬ 
longing  to  the  semiring  may  be,  one  can  find  in  @  a  finite  system  of 
pairwise  non-intersecting  sets  Bh  B2,  •  ■  ■ ,  Bt  such  that  each  Ak  can 
be  represented  as  a  union 

Ak  =  U  Bs 

of  some  of  the  sets  B,. 

Proof.  For  n  =  1  the  lemma  is  trivial  since  it  suffices  to  set 
t  =  1,  Bi  =  Ai.  Assume  that  it  is  true  for  n  =  m  and  consider 
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in  ©  some  system  of  sets  Alt  A2,  •  •  ■ ,  Am+1.  Let  Bh  Bh  •  •  •,  Bt  be 
sets  from  ©,  satisfying  the  conditions  of  the  lemma  with  respect 
to  Au  A 2,  •  •  •,  Am.  Let  us  set 

Bsl  =  Am+l  n  Ba. 

By  Lemma  1,  we  have  the  decomposition 

Am+ 1  =  U  B,i  u  U  BP',  Bp'  G  ©,  (1) 

s=l  p= 1 

and  by  the  definition  of  the  semiring  itself  we  have  the  decom¬ 
position 

Bs  =  B„ i  u  Ba 2  u  •  •  •  u  Btj, ,  Btq  G  ©  . 

It  is  easy  to  see  that 

ft 

Ak  =  U  \JBaq,  k  =  l,  2,  •  ■  ■ ,  m, 

8eM k  q= 1 

and  that  the  sets 

Bsq,  Bp' 

are  pairwise  non-intersecting.  Thus  the  sets  Bag,  Bv'  satisfy  the 
conditions  of  the  lemma  with  respect  to  A  lj  A  2}  *  *  *)  A  mi  Am±l. 
The  lemma  is  thus  proved. 

Theorem  3.  If  ©  is  a  semiring,  then  9 ?(©)  coincides  with  the 
system  3  of  the  sets  A,  which  admit  of  the  finite  decompositions 

A  =  U  Ak 

k= 1 

into  the  sets  Ak  G  ©. 

Proof.  Let  us  show  that  the  system  3  forms  a  ring.  If  A  and 
B  are  two  arbitrary  sets  in  3>  the  decompositions 

n  m 

A  =  U  Ak,  B  =  U  Bk,  Ak  G  ©,  Bk  G  ©, 

k=  1  lfc-1 

take  place.  Since  ©  is  a  semiring,  the  sets 


Cu  =  AiC\  B, 
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also  belong  to  ©.  By  Lemma  1  we  have  the  decompositions 

At  =  U  Cu  u  0  Dik,  Bj  =  U  Cti  u  0  (2) 

j  k=  1  i  k= 1 

where  Dik,  Ejk  £  ©.  From  equation  (2)  it  follows  that  the  sets 
A  n  B  and  A  A  B  admit  of  the  decompositions 

A  n  B  =  U  C,„ 

i.J 

A  A  B  =  U  Dik  u  U  Ejk, 

i,k  j.k 

and  therefore  belong  to  3-  Hence,  3  is  indeed  a  ring;  the  fact 
that  it  is  minimal  among  all  the  rings  containing  ©  is  obvious. 

In  various  problems,  in  particular  in  measure  theory,  one  has 
to  consider  unions  and  intersections  of  not  only  finite,  but  also 
infinite  numbers  of  sets.  Therefore  it  is  useful  to  introduce,  in 
addition  to  the  concept  of  a  ring  of  sets,  the  following  concepts. 

Definition  3.  A  ring  of  sets  is  called  a  c r-ring,  if  with  each  sequence 
of  sets  Ah  A 2,  •••,  i„,  •  •  •  it  also  contains  the  union 

S  =  U  An. 

n 

Definition  4.  A  ring  of  sets  is  called  a  6-ring,  if  in  addition  to 
each  sequence  of  sets  Ah  A  2,  •  •  • ,  A„,  •  •  •  it  also  contains  the 
intersection 

D  =  n  An. 

n 

It  is  natural  to  call  a  o--ring  with  a  unit  a  o-algebra,  and  a 
5-ring  with  a  unit  a  5-algebra.  However,  it  is  easy  to  see  that 
these  two  concepts  coincide :  each  c-algebra  is  at  the  same  time  a 
5-algebra,  and  each  5-algebra  is  a  o--algebra.  This  follows  from 
the  duality  relations 

U  An  =  E\f)(E\An), 

n  n 

n  An  =  E\U(E\An) 

n  n 

(see  Chapter  1,  §1  of  Volume  I).  The  5-algebras,  or  what  is  the 
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same  thing,  the  <r-algebras,  are  usually  called  Borel  algebras,  or 
simply  B-algebras. 

The  simplest  example  of  a  5-algebra  is  the  set  of  all  subsets  of 
some  set  A. 

A  theorem,  analogous  to  Theorem  2  proved  above  for  rings, 
holds  for  5-algebras. 

Theorem  4.  For  any  non-empty  system  of  sets  @  there  exists  a 
B-algebra  58  (@),  containing  @  and  contained  in  any  B-algebra 
which  contains  @. 

The  proof  follows  exactly  the  same  lines  as  does  the  proof  of 
Theorem  2.  The  5-algebra  58  (@)  is  called  the  minimal  B-algebra 
over  the  system  <&  or  the  Borel  closure  of  the  system  <5. 

So  called  Borel  sets  or  5-sets  play  an  important  role  in  analysis. 
These  sets  can  be  defined  as  sets  on  the  real  axis  belonging  to 
the  minimal  5-algebra  over  the  set  of  all  segments  [a,  6]. 

As  a  supplement  to  the  information  given  in  Chapter  1,  §7 
of  Volume  I,  let  us  note  the  following  facts  which  we  shall  need 
in  Chapter  II. 

Let  y  =  f(x)  be  a  function  defined  on  the  set  M  with  values 
from  the  set  N.  Let  us  denote  the  system  of  all  maps  /( A )  of  sets 
from  the  system  9ft  (we  assume  that  SO1?  consists  of  subsets  of  the 
set  M)  by  /(9ft)  and  the  system  of  all  subimages  f~'(A)  of  sets 
from  91  (we  assume  that  91  consists  of  subsets  of  the  set  N )  by 
/_1(9l).  The  following  statements  are  true: 

1)  If  91  is  a  ring,  then  /_1(9t)  is  a  ring. 

2)  If  91  is  an  algebra,  then  /_1(9(1)  is  an  algebra. 

3)  If  91  is  a  5-algebra,  then /-1(9 1)  is  a  5-algebra. 

'4)  5R(/-1(91))  =  /-*(»(  91)). 

5)  58  (/-1  (91))  =/-'(  58(91)). 

Let  ft  be  some  ring  of  sets.  If  in  it  we  take  the  operation  A  A  5  to 
be  “addition”  and  A  D  B  to  be  “multiplication”,  then  ft  is  a  ring  in  the 
usual  algebraic  sense  of  the  word.  All  its  elements  will  satisfy  the  con¬ 
ditions 

a  +  a  =  0,  a2  =  a.  ( * ) 
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Rings  whose  elements  satisfy  condition  (>|<)  are  called  “Boolian”  rings. 
Each  Boolian  ring  can  be  realised  as  a  ring  of  sets  with  the  operations 
A  A  B  and  A  fl  B  (Stone). 


3.  Measures  on  Semirings.  Continuation  of  a  Measure  from  a 
Semiring  to  the  Minimal  Ring  over  it 

In  §1,  when  considering  measure  in  the  plane,  we  started  from 
the  measure  of  a  rectangle  (area)  and  then  extended  the  concept 
of  measure  to  a  wider  class  of  sets.  The  results  as  well  as  the 
methods  given  in  §1  have  a  quite  general  character  and  can  be 
generalized  to  measures  defined  on  arbitrary  sets  without  essen¬ 
tial  changes.  The  first  step  in  constructing  a  measure  on  a  plane 
consisted  in  generalizing  the  concept  of  measure  from  rectangles 
to  elementary  sets,  i.e.,  to  finite  systems  of  pairwise  non-inter¬ 
secting  rectangles. 

In  this  section  we  shall  consider  the  abstract  analogue  of  this 
problem. 

Definition  1.  The  set  function  v(A')  is  called  a  measure  if: 

1)  its  domain  of  definition  S„  is  a  semiring  of  sets; 

2)  its  values  are  real  and  non-negative; 

3)  it  is  additive,  i.e.,  for  any  finite  decomposition 

A  =[}Ak 

of  the  set  i  (  S,  into  sets  Ak  £  SM,  the  equation 

m(4)  = 

holds. 

Remark.  From  the  decomposition  0=0U0  it  follows  that 

m(0)  =  2m(0),  i.e.,  m(0)  =  o. 

The  following  two  theorems  about  measures  on  semirings  will 
be  frequently  used  in  the  subsequent  pages. 

Theorem  1 .  Let  n  be  a  measure  defined  on  some  semiring  <S„.  If  the 
sets  Ax,  A  2,  •  •  ■,  An,  A  belong  to  Sf,  where  the  Ak  are  pairwise  non- 
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intersecting  and  all  belong  to  A,  then 

Z  fi(Ak)  <  n(A). 

b**  1 

Proof.  Since  is  a  semiring  there  exists,  according  to  Lemma  1, 
§2,  the  decomposition 

g 

A  ==  U  Ak)  S  ^  71  ,  Ak  ^  $fi) 

k=l 

where  the  first  n  sets  coincide  with  the  given  sets  Ah  A2,  •  •  •,  An. 
Since  the  measure  of  any  set  is  non-negative, 


ZmU*)  <  EmW  =  m(A). 


*-i 


Theorem  2.  If  Ah  A2,  An,  A  belong  to  S M  and  i  c  | J  A*,  then 

k=  1 


m(A)  <  EmU*). 

k=i 

Proof.  By  Lemma  2,  §2,  one  can  find  a  system  of  pairwise  non¬ 
intersecting  sets  B i,  jB2,  Bt  from  £„,  such  that  each  of  the 
sets  Ah  A 2,  A A  can  be  represented  as  a  union  of  some  of 
the  sets  B,  : 

A  =  U  B,,  Ak  =  U  B,,  k  =  1,2,  •  -  - ,  n. 

se3/0  se.W* 

Moreover,  each  index  s  £  M„  also  belongs  to  some  member  of 
Mk.  Therefore  each  term  of  the  sum 

Z  *(£,)  =  m(4) 

enters  once,  or  at  most  a  few  times,  into  the  double  sum 

z  z  m(b.)  =  i>u*). 

A=1  /c=l 
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This  yields 

v(A)  <  DmW. 

k= 1 

In  particular,  for  n  =  1,  we  have  the 
Corollary.  If  A  C  A',  then  n(A)  <  n(A'). 

Definition  2.  The  measure  ft(A)  is  called  the  continuation  of  the 
measure  m{A)  if  Sm  C  S„  and  if,  for  every  A  €  Sm,  the  equality 

ix(A)  =  m(A) 

holds. 

The  main  aim  of  the  present  section  is  the  proof  of  the  follow¬ 
ing  proposition. 

Theorem  3.  Every  measure  m(A)  has  one  and  only  one  continua¬ 
tion  ix(  A),  having  as  its  domain  of  definition  the  ring  9t(Sm). 

Proof.  For  each  set  A  6  9t(Sm)  there  exists  a  decomposition 

A  =  U  Bk ,  Bke  Sm,  (1) 

k= 1 

(Theorem  3,  §2) .  Let  us  assume  by  definition 

n(A)  =  !>(&).  (2) 

k=l 

It  is  easy  to  see  that  the  quantity  ju(A),  given  by  equation  (2), 
does  not  depend  on  the  selection  of  the  decomposition  (1). 
Indeed,  let  us  consider  the  two  decompositions 

m  n 

A  =  U  Bi  =  U  ci}  Bi  e  sm,  c,  e  sm. 

i=l  j=l 

Since  all  intersections  Bi  n  C}  belong  to  Sm,  we  have,  because  of 
the  additivity  of  measures, 

m  n 

=  £  £  m{Bi  n  Cf)  =  £  «(<?,), 

i=l  J=l 
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which  was  to  be  proved.  The  fact  that  the  function  n(A),  given 
by  equation  (2)  is  non-negative  and  additive  is  obvious.  Hence 
the  existence  of  a  continuation  n(A)  of  the  measure  m( A )  is 
shown.  To  show  its  uniqueness  let  us  note  that,  by  definition  of 

n 

continuation,  if  A  =  U  Bk,  where  Bk  are  non-intersecting  sets 

k=l 

from  Sm,  then  for  any  continuation  p*  of  the  measure  m  onto  the 
ring  9t(Sm) 

M*(A)  =  I >*(&)  =  I lm(Bt)  =  mU), 

i.e.,  the  measure  p*  coincides  with  the  measure  p  defined  by 
equation  (2) .  The  theorem  is  proved. 

The  connection  between  this  theorem  and  the  constructions  of 
§1  will  be  completely  clear  if  we  note  that  the  set  of  rectangles 
in  the  plane  is  a  semiring,  the  area  of  these  rectangles  is  a  measure 
in  the  sense  of  Definition  1,  and  the  elementary  plane  sets  form  a 
minimal  ring  over  the  semiring  of  the  rectangles. 

4.  Continuations  of  Jordan  Measures* 

In  the  present  section  we  shall  consider  the  general  form  of 
that  process  which  in  the  case  of  plane  figures  allows  one  to 
generalise  from  the  definition  of  areas  for  a  finite  union  of  rec¬ 
tangles,  with  sides  parallel  to  the  axes  of  coordinates,  to  areas  of 
all  those  figures  for  which  areas  are  defined  by  elementary  ge¬ 
ometry  or  classical  analysis.  This  extension  was  given  with 
complete  precision  by  the  French  mathematician  Jordan  around 
1880.  The  basic  idea  of  Jordan  goes  back,  incidentally,  to  the 
mathematicians  of  ancient  Greece  and  consists  of  approximating 
from  the  inside  and  from  the  outside  the  “measurable”  set  A  by 
sets  A'  and  A"  to  which  a  measure  has  already  been  prescribed, 
i.e.,  in  such  a  way  that  the  inclusions 

4'C4C  A" 

are  fulfilled. 

*  The  concept  of  a  Jordan  measure  has  a  definite  historical  and  methodological 
interest  but  is  not  used  in  this  exposition.  The  reader  may  omit  this  section  if  he 
wishes. 
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Since  we  can  continue  any  measure  onto  a  ring  (Theorem  3, 
§3),  it  is  natural  to  assume  that  the  initial  measure  m  be  defined 
on  a  ring  91  =  9t(Sm).  This  assumption  will  be  used  during  the 
whole  of  the  present  section. 

Definition  1 .  We  shall  call  a  set  A  Jordan  measurable  if,  for  any 
e  >  0,  there  exist  in  the  ring  3?  sets  A'  and  A"  which  satisfy 
the  conditions 

A'  C  A  C  A”,  m(A"\A')  <  t. 

Theorem  1.  The  system  91*  of  Jordan  measurable  sets  is  a  ring. 

Indeed,  let  A  £  91*,  B  6  9t*;  then,  for  any  s  >  0,  there  exist 
A',  A",  B',  B"  6  91  such  that 

A'CiC  A",  B'  C  B  C  B", 
and 

m(A"\A')  <  -£,  m(B"\B')  <  -S. 

Hence 

A'  u  B'  c  A  u  B  c  A"  u  B",  (1) 

A'\B"  C  A\B  C  A"\B'.  (2) 

Since 

(A"  u  B")\(A'  u  B ')  C  (T^XT')  u  (B"\B'), 

we  have 

m[_{A"  UB")\(A'  u  S')]  <  m[(T"\T')  u  (B"\B')f 

<  m(A"\A’)  +  m(B"\B')  =  s.  (3) 

Since 

(^"XBOXf^'XB")  £  (A"\jT)  u  (B"\B'), 

we  have 

m[_{A ,r\B') \ ( A r\B" ) J  <  m[(A"\A')  U  (B"\B')J 

<  tn(A"\A')  +  m(B"\B')  <  j  +  j  =  e.  (4) 
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Since  e  >  0  is  arbitrary,  and  the  sets  A'  u  B',  A"  u  B ",  A'\B" 
and  A"\B'  belong  to  9J,  (1),  (2),  (3)  and  (4)  imply  that 
A  u  B  and  A\B  belong  to  91*. 

Let  ©J  be  a  system  of  those  sets  A  for  which  the  set  B  2  A 
of  9J  exists.  For  any  A  from  ©1  we  set,  by  definition, 

P(A)  =  inf  m(B), 
l*(A)  =  sup  m(B). 

BSA 

The  functions  p(A)  and  fi(A)  are  called,  respectively,  the 
“outer”  and  the  “inner”  measure  of  the  set  A. 

Obviously,  always 

a(A)  <  p(A). 

Theorem  2.  The  ring  9?*  coincides  with  the  system  of  those  sets 
A  6  ©2  for  which  »(A)  =  p(A). 

Proof.  If 

fi(A)  n(A), 

then 

fi(A)  -  e(A)  =h>  0, 

and  for  any  A'  and  A"  from  9?  for  which  A'  c  A  c  A", 
m(A')  <  p(A),  m(A")  >  fi(A), 

m(A"\A')  =  m(A")  -  m(A')  >  h  >  0, 

i.e.,  A  cannot  belong  to  9?*. 

Conversely,  if 

ii(A)  =  P(A), 

then,  for  any  s  >  0,  there  exist  A'  and  A"  from  9i  for  which 

A'  c  A  C  A", 

a(A)  -  m(A')  <  |, 

m(A")  -  n(A)  <  |, 
m(A"\A')  =  m(A")  -  m(A')  <  e, 
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i.e., 

a  e  9f*. 

The  following  theorems  hold  for  sets  from  'M. 

n  n 

Theorem  3.  If  A  C  U  Ak,  then  ix{A)  <  23  ji{Ak). 

k=  1  *— 1 

Proof.  Let  us  select  Ak'  such  that 

AkQAk',  m(Ak')  <  p(Ak)  +  £, 

n 

and  let  us  form  A'  =  U  Ak  .  Then, 

k= 1 


m(A')  <  <  £  p(Ak)  +  e,  fi(A)  <  £  fi(At)  +  «, 


and  since  s  is  arbitrary,  jji(A)  <  23  P(Ak). 

k= 1 

Theorem  4.  //lt  Ci  (/c  =  1,  2,  •  •  •,  n)  and  A{  n  Aj  =  0,  then 
n(A)  > 

k=l 

Proof.  Let  us  select  Aa.'  c  A*  such  that 

>  fi(Ak)  -  ^  , 


n 

and  let  us  form  A'  =  U  Ak.  Then  A/  n  A/  =  0  and 

A  = 1 

m(A')  =  £  m(Ak)  >  +  e. 

A= 1  A 


m(4)  >  m(T')  >  £m(^U  -  e. 


k= 1 


Since  A'  c  A, 


4.  CONTINUATIONS  OF  JORDAN  MEASURES 


33 


Because  s  >  9  is  arbitrary, 

m(A)  >  EmW. 

*=1 

Let  us  now  define  the  function  m  with  the  domain  of  definition 

S„  =  ft* 

as  the  common  value  of  the  inner  and  outer  measures : 

m(A)  =  mU)  =  a  (A). 

Theorems  3  and  4,  and  the  obvious  fact  that  for  A  £  ft 
Vt{A)  =  m(A)  =  m(A), 

imply 

Theorem  5.  The  function  n(A)  is  a  measure  and  a  continuation  of 
the  measure  m. 

The  construction  given  can  be  used  for  any  measure  m  defined 
on  a  ring. 

The  system  S„,  =  @  of  elementary  sets  in  a  plane  is  essentially 
connected  with  the  coordinate  system:  the  sets  of  the  system  © 
consist  of  rectangles  with  sides  which  are  parallel  to  the  co¬ 
ordinate  axes.  In  going  over  to  the  Jordan  measure 

J(2)  =  j(m») 

this  dependence  on  the  choice  of  a  coordinate  system  disappears: 
starting  from  an  arbitrary  system  of  coordinates  \xh  x2\  connected 
with  the  initial  system  \xi,  x2\  by  the  orthogonal  transformation 

x2  =  cos  a-Xi  +  sin  a-x2  +  ai, 
x2  =  —sin  a-Xi  +  cos  a-x2  +  a2, 
we  obtain  the  same  Jordan  measure 

J  =  j{m  2)  =  j(m2) 

(here  m2  denotes  the  measure  constructed  with  the  help  of  rec¬ 
tangles  with  sides  parallel  to  the  axes  xu  x2).  This  fact  can  be 
proved  with  the  help  of  the  following  general  theorem: 

Theorem  6.  In  order  that  the  Jordan  continuations  tx\  =  j(m i)  and 
M2  =  j(m 2)  of  the  measures  mi  and  m2  defined  on  the  rings  fti  and 
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9?  2  coincide- ,  it  is  necessary  and  sufficient  that  the  following  condi¬ 
tions  be  satisfied: 

9Ji  €  <SM2,  mt(A)  =  in(A)  on  9h, 

%  £  Sn,  m2(A )  =  ih(A)  on  9f2. 

The  necessity  of  the  condition  is  obvious.  Let  us  prove  their 
sufficiency. 

Let  A  £  SMr  Then  there  exist  A',  A"  £  Sm„  such  that 
4'CAC  A",  m,(T")  -  mi  (T')  <  |, 

O 

and 

mi  (TO  <  /*i(T)  <  mi(T"). 

By  the  conditions  of  the  theorem,  n2(A')  =  mt  (T')  and  ix2  (A”)  = 
mi  {A"). 

From  the  definition  of  the  measure  m  it  follows  that  there  exist 
B'  £  Sm,  and  B"  £  Sm,  for  which, 

T'  2  B'  and  #t2(T')  —  m2(B')  <  |, 

u 

B"  2  A"  and  m2(B")  -  M2(T")  <  l 

O 

Here 

B'  c  T  c  B", 

and,  obviously, 

m2{B")  —  m2(B')  <  s. 

Since  e  >  0  is  arbitrary,  A  £  S^,  and  from  the  relations 
Mi(B0  =  rnfiB’)  <  W(T)  <  m*(fl")  =  Mi(B") 
it  follows  that 

m2(T)  =  ^(T). 

The  theorem  is  proved. 

To  establish  that  the  Jordan  measure  in  the  plane  is  inde¬ 
pendent  of  the  choice  of  the  system  of  coordinates,  one  need  only 


convince  oneself  that  a  set  which  is  obtained  from  an  elementary 
set  by  a  rotation  through  some  angle  a  is  Jordan  measurable.  It 
is  suggested  that  the  reader  do  this  for  himself. 

If  the  initial  measure  is  given,  not  on  a  ring,  but  on  a  semiring, 
then  it  is  natural  to  consider  as  its  Jordan  continuation  the 
measure 


j(m)  =  j(r(m)) , 

obtained  as  a  result  of  a  continuation  of  m  to  the  ring  9?  (Sm )  and 
a  subsequent  continuation. 


5.  Countable  Additivity.  General  Problem  of  Continuation 

of  Measures 

Often  one  must  consider  the  union  of  not  only  a  finite,  but  of 
a  countable  number  of  sets.  In  this  connection  the  condition  of 
additivity  which  we  have  imposed  on  measures  (Definition  1, 
§2)  turns  out  to  be  insufficient  and  it  is  natural  to  replace  it  by 
the  stronger  requirement  of  countable  additivity. 

Definition  1 .  The  measure  y.  is  called  countably  additive  (or 
a-additive) ,  if,  for  any  sets  A,  A.,  •  •  •,  A„,  ■  •  •,  belonging  to  its 
domain  of  definition  S„  and  satisfying  the  conditions 

A  =  U  An, 

n= 1 

Ai  n  A;  =  0  for  i  0  j, 

the  equality 

m(A)  =  £  n(An) 

n= 1 

holds.  The  plane  Lebesgue  measure  which  we  constructed  in  §1 
is  a--additive  (Theorem  9).  An  example  of  a  o--additive  measure 
of  a  completely  different  kind  can  be  constructed  in  the  following 
way.  Let 
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be  an  arbitrary  countable  set,  and  let  the  numbers  p„  >  0  be 
such  that 

Z)p»  =  !• 

The  domain  S „  consists  of  all  subsets  of  the  set  X.  For  each 
A  c  X  we  set 

/*M)  =  £  P»- 

x„eA 

It  is  easy  to  check  that  /i(  A )  is  a  <r-additive  measure,  where 
)x(X )  =  1.  This  example  occurs  naturally  in  connection  with 
many  questions  of  probability  theory. 

Let  us  give  an  example  of  a  measure  which  is  additive  but  not 
(r- additive.  Let  X  be  the  set  of  all  rational  points  of  the  segment 
[0,  1],  and  S„  consist  of  the  intersections  of  the  set  X  with 
arbitrary  intervals  (a,  b),  segments  [a,  6]  and  half  segments 
(a,  6],  [a,  b).  It  is  easy  to  see  that  *S„  is  a  semiring.  For  each 
such  set  we  put 

n(Aab)  =  b  —  a. 

This  is  an  additive  measure.  It  is  not  ^-additive  because,  for 
example,  n(  X )  =  1  and  at  the  same  time  X  is  the  union  of  a 
countable  number  of  separate  points,  the  measure  of  each  one  of 
which  is  zero. 

In  this  and  the  two  following  sections  we  shall  consider  (7-addi¬ 
tive  measures  and  their  different  ^-additive  continuations. 

Theorem  1 .  If  the  measure  m,  defined  on  some  semiring  Sm,  is 
countably  additive,  then  the  measure  n  =  r(m),  obtained  from  it  by 
continuation  to  the  ring  is  also  countably  additive. 

Proof.  Let 

A  e  Bn£3l(Sm),  n  =  1,2,  •••, 


A  =  0  Bn, 

n*= 1 


and 
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where  B,  D  Br  =  0  for  s  0  r.  Then  there  exist  sets  A,-  and  Bn%  from 
Sm  such  that 

,1  =  U  Aj,  Bn  =  u  Bni, 

i  i 

where  the  sets  on  the  right-hand  sides  of  each  of  these  equations 
are  pairwise  non-intersecting  and  the  union  over  i  and  j  is  finite. 
(Theorem  3,  §2). 

Let  Cnij  =  Bni  n  Aj.  It  is  easy  to  see  that  the  sets  Cnu  are  pair¬ 
wise  non-intersecting,  and  hence, 

/1,  =  UU  Cnii, 

n  i 

Bni  =  U  Cnii. 

i 

Therefore,  and  because  of  the  additivity  of  the  measure  m  on 
Sm,  we  have 

m(Aj)  =  12  22  m(Cnn),  (!) 

»  i 

m(Bnt)  =  £»i(Q,  (2) 

j 

and,  by  definition  of  the  measure  r{m)  on  9t(S„), 

M(A)=Zm(Ai),  (3) 

i 

m(B„)  =  12  m(Bn{).  (4) 

i 

Equations  (1),  (2),  (3)  and  (4)  imply  p(A)  =  12  n(Bn).  (The 

n 

summations  over  i  and  j  are  finite,  the  series  in  n  converge. ) 

One  could  show  that  a  Jordan  continuation  of  a  o-additive 
measure  is  always  o-- additive;  there  is  however  no  need  to  do  this 
in  this  special  case  since  it  will  follow  from  the  theory  of  Lebesgue 
continuations  which  will  be  given  in  the  next  section. 

Let  us  now  show  that,  for  the  case  of  <r-additive  measures, 
Theorem  2  of  §3  may  be  extended  to  countable  coverings. 

Theorem  2.  If  the  measure  »  is  a-additive,  and  the  sets  A,  Au 
A 2,  •  •  •,  An,  •  •  •  belong  to  SM,  then 

A  C  0  An 

n= 1 
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implies  the  inequality 


m(A)  <  Em(A„). 

n=l 

Proof.  By  Theorem  1,  it  is  enough  to  give  the  proof  for  measures 
defined  on  a  ring,  since  from  the  validity  of  Theorem  2  for  y  = 
r{m)  it  immediately  follows  that  it  can  be  applied  also  to  the 
measure  m.  If  S „  is  a  ring,  the  sets 

Bn  =  ( A  n 

k=l 

belong  to  S Since 


A  =  0  Bn,  B„  C  An, 

n= 1 

and  since  the  sets  Bn  are  pairwise  non-intersecting, 

m(A)  =  SmC Bn)  <  £m(A,). 

n— 1  n=l 

From  now  on  we  shall,  without  special  mention,  consider  only 
(r-additive  measures. 

We  have  already  considered  above  two  methods  of  continuation  of 
measures.  In  connection  with  the  continuation  of  the  measure  m  to  the 
ring  in  §3  we  noted  the  uniqueness  of  this  continuation.  The 

case  of  a  Jordan  continuation  j(m)  of  an  arbitrary  measure  m  is  analogous. 
If  the  set  A  is  Jordan  measurable  with  respect  to  the  measure  m  (belongs 
to  f(m) ) ,  then,  for  any  measure  y  continuing  m  and  defined  on  A,  the 
value  y(A )  coincides  with  the  value  J (A)  of  the  Jordan  continuation 
J  =  j(m').  One  can  show  that  the  extension  of  the  measure  m  beyond  the 
boundaries  of  the  system  Sym >  is  not  unique.  More  precisely  this  means 
the  following.  Let  us  call  the  set  A  the  set  of  uniqueness  for  the  measure 
m,  if: 

1)  there  exists  a  measure  which  is  a  continuation  of  the  measure  m, 
defined  for  the  set  A; 

2)  for  any  two  measures  of  this  kind  y,  and  yi 

Mi(d)  =  M2  (A). 

The  following  theorem  holds:  The  system  of  sets  of  uniqueness  for  the 
measure  m  coincides  with  the  system  of  sets  which  are  Jordan  measur¬ 
able  with  respect  to  the  measure  m,  i.e.,  with  the  system  of  sets  <S ,•<„). 
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However,  if  one  considers  only  cr-additive  measures  and  their  continua¬ 
tion  (o-additive),  then  the  system  of  sets  of  uniqueness  will  be,  generally 
speaking,  wider. 

Since  it  is  the  case  of  o-additive  measures  that  will  interest  us  in  the 
future  let  us  establish 

Definition  2.  The  set  A  is  called  the  set  of  o-uniqueness  for  a  0-addi- 
tive  measure  n,  if : 

1)  there  exists  a  o-additive  continuation  X  of  the  measure  m  defined 
for  A  (i.e.,  such  that  A  £  S\); 

2)  for  two  such  o-additive  continuations  Xi  and  X2  the  equation 

Xi(A)  =  X2(A) 

holds.  If  A  is  a  set  of  o-additivity  for  the  o-additive  measure  ju,  then,  by 
our  definition,  there  exists  only  one  possible  X(A)  for  the  o-additive 
continuation  of  the  measure  n,  defined  on  A. 


6.  Lebesgue  Continuation  of  Measure,  Defined  on  a  Semiring 

with  a  Unit 

Even  though  the  Jordan  continuation  allows  one  to  generalise 
the  concept  of  measure  to  quite  a  wide  class  of  sets,  it  still  remains 
insufficient  in  many  cases.  Thus,  for  example,  if  we  take  as  the 
initial  measure  the  area,  and  as  the  domain  of  its  definition  the 
semiring  of  rectangles  and  consider  the  Jordan  continuation  of 
this  measure,  then  even  such  a  comparatively  simple  set  as  the 
set  of  points,  the  coordinates  of  which  are  rational  and  satisfy 
the  condition  x2  +  y2  <  1,  is  not  Jordan  measurable. 

A  generalisation  of  a  o-additive  measure  defined  on  some  semi¬ 
ring  to  a  class  of  sets  which  is  maximal  in  the  well  known  sense 
can  be  obtained  with  the  help  of  the  so-called  Lebesgue  con¬ 
tinuation.  In  this  section  we  shall  consider  the  Lebesgue  con¬ 
tinuation  of  a  measure  defined  on  a  semiring  with  a  unit.  The 
general  case  will  be  considered  in  §7. 

The  construction  given  below  represents,  to  a  large  degree,  a 
repetition,  in  abstract  terms,  of  the  construction  of  the  Lebesgue 
measure  for  plane  sets  given  in  §1. 
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Let  a  0-additive  measure  m  be  given  on  some  semiring  of  sets 
Sm  with  unit  E.  We  shall  define  on  the  system  ©  of  all  subsets 
of  the  set  E  the  functions  and  t**(A)  in  the  following  way. 

Definition  1.  The  number 

H*(A)  =  inf 

A  £U#n  n 

n 

where  the  lower  bound  is  taken  over  all  coverings  of  the  set  A  by 
finite  or  countable  systems  of  sets  Bn  £  Sm,  is  called  the  outer 
measure  of  the  set  A  c  E. 

Definition  2.  The  number 

m*M)  =  m(E)  -  n*(E\A) 
is  called  the  inner  measure  of  the  set  A  Q  E. 

From  Theorem  2,  §3  it  follows  that  always  n*(A)  <  v*(A). 
Definition  3.  The  set  A  c  E  is  called  measurable  (Lebesgue),  if 
m*(A)  =  m*L4). 

If  A  is  measurable,  then  we  shall  denote  the  common  value 
m*(A)  =  n*(A)  by  n(A)  and  call  it  the  (Lebesgue)  measure  of 
the  set  A. 

It  is  obvious  that,  if  A  is  measurable,  then  its  complement  is 
altfo  measurable. 

Theorem  2,  §5  immediately  implies  that  for  any  0-additive 
continuation  n  of  the  measure  m  the  inequality 

M*(d)  <  m(A)  <  M*(d) 

holds.  Therefore,  for  a  measurable  set  A,  each  0-additive  con¬ 
tinuation  n  of  the  measure  m  (if  it  exists  at  all)  necessarily  equals 
the  common  value  m*M)  =  m*(A).  The  Lebesgue  measure  is 
nothing  but  the  0-additive  continuation  of  the  measure  m  to  the 
set  of  all  measurable  (in  the  sense  of  Definition  3)  sets.  The 
definition  of  measurability  can  obviously  also  be  formulated  in 
the  following  way: 

Definition  3'.  The  set  A  c  E  is  called  measurable,  if 
M*U)  +ix*(E\A)  =  m{E). 
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It  is  expedient  to  use,  aside  of  the  initial  measure  to,  its  con¬ 
tinuation  m!  =  r(  m )  onto  the  ring  3t  (Sm)  which  is  already  known 
to  us  (§3).  It  is  clear  that  the  following  definition  is  equivalent 
to  Definition  1. 

Definition  1 '.  The  number 

M*U)  =  inf  Zwi'(Bn'),  Bn'  C  M(Sn), 

Ae  u£„'  n 
n 

is  called  the  outer  measure  of  the  set  A. 

Indeed,  since  the  measure  to'  is  o--additive  (Theorem  1,  §5),  any 
sum  m'(Bn'),  where  BJ  £  3?  («Sm),  can  be  replaced  by  the  sum 

n 

^  '  m(Fnk)  ,  Bnk  £  Sm , 

n,k 

which  is  equal  to  it,  and  where  B„'  =  U  Bnk ,  Bni  n  Bnj  =  0  if 
i  *  j.  k 

The  following  are  basic  facts. 

Theorem  1.  If 

icUl,, 

where  (i„)  is  a  finite  or  countable  system  of  sets,  then 

<  I VMJ. 

n 


Theorem  2 .  If  A  6  31,  then  n*(A)  =  m’(A )  =  n*(A),  i.e.,  all 
the  sets  from  3?  ( Sm )  are  measurable,  and  for  them  the  inner  and 
outer  measures  coincide  with  to'. 

Theorem  3.  For  the  measurability  of  the  set  A  the  following  con¬ 
dition  is  necessary  and  sufficient:  for  any  s  >  0  there  exists  a 
B  6  31  (Sm )  such  that 

M*(d  A  B)  <  E. 

In  §1  these  statements  were  proved  for  the  plane  Lebesgue 
measure  (Theorems  3-5,  §1).  The  proofs  given  there  can  be 
carried  over  word  for  word  to  the  general  case  considered  here, 
therefore  we  shall  not  repeat  them. 
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Theorem  4.  The  system  ©f  of  all  measurable  sets  is  a  ring. 
Proof.  Since  it  is  always  true  that 

Ai  n  A2  =  -Ai\(.Ai\.i4.2) 


and 


AiU42  =  E\[(£\A0  n 

it  suffices  to  show  the  following.  If  Ax  £  ©f  and  A2  6  ©J,  then  also 

A  =  Ai\A»  e  SK. 

Let  Ax  and  A2  be  measurable;  then  there  exist  Bx  g  9t(Sm)  and 
B 2  €  dt(Sm)  such  that 


m*(Ai  ABi)  <  ^  and  n*(A2  A  B2)  <  A 

Setting  B  =  Bx\B2  €  9?(<Sm)  and  using  the  relation 
A  (ft\ft)  C  (Ai  A  B{)  u  (A2  A  ft), 

we  obtain 

n*(A  A  B)  <  e. 

Since  5  >  0  is  arbitrary,  this  implies  that  the  set  A  is  meas¬ 
urable. 

Remark.  Obviously  E  is  the  unit  of  the  ring  ©}  which,  therefore, 
is  an  algebra  of  sets. 

Theorem  5.  On  the  system  ©1  of  measurable  sets,  the  function 
n(A)  is  additive. 

The  proof  of  this  theorem  is  a  word  for  word  repetition  of  the 
proof  of  Theorem  7,  §1. 

Theorem  6.  On  the  system  ©1  of  measurable  sets,  the  function 
n(A)  is  a-additive. 

Proof.  Let 


CO 

A  =  U  A 

n= 1 


A  C  ©}, 


A  i  n  A  j  =  0  for  i  0  j. 
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By  Theorem  1, 

m*(A)  <  (1) 

» 

and  by  Theorem  5,  for  any  N 

r*U)  >  A  U  A.)  = 

\m=1  /  n= 1 

implying 

m*M)  >  EmU-)-  (2) 

n 

Inequalities  (1)  and  (2)  yield  the  assertion  of  the  theorem. 

Thus  we  have  established  that  the  function  n{A),  defined  on 
the  system  ©f,  possesses  all  the  properties  of  a  <r-additive  measure. 
Hence  the  following  definition  is  verified : 

Definition  4.  One  calls  the  function  n(A),  defined  on  a  system 
S?  =  2ft  of  measurable  sets,  and  coinciding  on  this  system  with 
the  outer  measure  n*(A),  the  Lebesgue  continuation  n  —  L(m ) 
of  the  measure  m(A). 

In  §1,  considering  the  plane  Lebesgue  measure,  we  have  shown 
that  not  only  the  finite  but  also  the  countable  unions  and  inter¬ 
sections  of  measurable  sets  are  also  measurable  sets.  This  is  true 
also  in  the  general  case  i.e.,  the  following  theorem  holds. 

Theorem  7.  The  system  ©}  of  Lebesgue  measurable  sets  is  a  Borel 
algebra  with  unit  E. 

Proof.  Since 

fl  A„  =  £r\U(£f\A,), 

n  n 

and  since  the  complement  of  a  measurable  set  is  measurable,  it 
suffices  to  show  the  following.  If  A1}  A2>  •  •  *,  A„,  •  •  •  belong  to  ©f, 
then  A  =  U  An  also  belongs  to  ©}.  The  proof  of  this  statement 

n 

given  in  Theorem  8,  §1,  for  plane  sets,  is  literally  preserved  also 
in  the  general  case. 
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Exactly  as  in  the  case  of  a  plane  Lebesgue  measure  its  e-addi¬ 
tivity  implies  its  continuity,  i.e.,  if  n  is  a  e-additive  measure, 
defined  on  a  5-algebra,  li  2  12  D  •••  2  A„  2  •  •  •  is  a  de¬ 
creasing  chain  of  measurable  sets  and 

A  =  ft  An, 


then 


n(A)  =  lim  n(A„), 

n-*oo 

and  if  4i  c  il,  c  ...  c  A„  c  •  •  •  is  an  increasing  chain  of 
measurable  sets  and 

A  =  U  An, 


then 


n(A)  =  lim  n(An). 

n-*  co 

The  proof  given  in  §1  for  a  plane  measure  (Theorem  10 )  can  be 
carried  over  to  the  general  case. 

1)  From  the  results  of  §§5  and  6  it  is  easy  to  deduce  that  every  set  A 
which  is  Jordan  measurable  is  Lebesgue  measurable;  moreover  its  Jordan 
and  Lebesgue  measures  are  equal.  This  immediately  implies  that  the 
Jordan  continuation  of  a  u-additive  measure  is  c-additive. 

2)  Every  set  A  which  is  Lebesgue  measurable  is  a  set  of  uniqueness 
for  the  initial  measure  m.  Indeed,  for  any  e  >  0  there  exists  for  A  a 
5^9 i  such  that  n*(A  A  B)  <  «.  Whatever  the  extension  X  of  the 
measure  m  may  be, 

X(5)  =  m'(5), 

since  the  continuation  of  the  measure  m  to  9i  =  9i  (Sm)  is  unique. 
Furthermore, 

\(A  A  B)  <  m*(A  A  B)  <  g, 

and  therefore 

|X(d)  -  tn'(B)  |  <  s. 

Thus  we  have,  for  two  arbitrary  continuations  Ai(T)  and  XofT)  of  the 
measure  m, 


Xi(T)  —  Xz(T)  |  <  2s, 
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which,  because  of  the  arbitrariness  of  e,  implies 

Xi  ( d. )  =  X2  ( d ) . 


One  can  show  that  the  system  of  Lebesgue  measurable  sets  exhausts  the 
whole  system  of  sets  of  uniqueness  for  the  initial  measure  m. 

3)  Let  m  be  some  <r-additive  measure  with  the  domain  of  definition 
S  and  let  ©1  =  L  (S)  be  the  domain  of  definition  of  its  Lebesgue  con¬ 
tinuation.  From  Theorem  3  of  this  section  it  easily  follows  that  whatever 
the  semiring  Si  for  which 

S  C  Si  C  ©l, 

we  always  have 


L(Si)  =  L(S). 


7.  Lebesgue  Continuation  of  Measures 
In  the  General  Case 

If  the  semiring  Sm  on  which  the  initial  measure  m  is  defined 
does  not  have  a  unit,  then  the  exposition  of  §6  must  be  slightly 
changed.  Definition  1  of  the  outer  measure  is  preserved,  but  the 
outer  measure  n*  turns  out  to  be  defined  only  on  the  system  <S„* 
of  such  sets  A  for  which  the  coverings  U  B„  by  sets  from  Sm  with 
a  finite  sum 

'Zm(Bn) 

n 

exists.  Definition  2  loses  its  meaning.  The  inner  measure  may  be 
defined  (in  a  slightly  different  way)  also  in  the  general  case,  but 
we  shall  not  go  into  this.  For  the  definition  of  measurability  of 
sets  it  is  expedient  to  take  now  the  property  of  measurable  sets 
implied  by  Theorem  3. 

Definition  1.  The  set  A  is  called  measurable,  if  for  any  e  >  0 
there  exists  a  set  B  £  Sm  such  that  n*(A  A  B)  <  s. 

Theorems  4,  5  and  6  and  the  final  Definition  4  stay  in  force.  In 
the  proofs  we  used  the  assumption  of  the  existence  of  a  unit  only 
in  proving  Theorem  4.  To  give  the  proof  of  Theorem  4  for  the 
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general  case  we  must  show  again  that  from  Al£  M,  A 2  d  M  it 
follows  that  Ax  u  A2  £  M.  This  proof  is  carried  out  exactly  as  for 
Ai\A2  on  the  basis  of  the  inclusion 

(Ti  u  Af)  A  ( Bi  u  Bf)  ^  ( Ai  A  Bi )  u  (A2  A  B 2). 

In  the  case  when  Sm  does  not  have  a  unit,  Theorem  7,  §6  is 
replaced  by  the  following: 

Theorem  1 .  For  any  initial  measure  m,  the  system  of  sets  SOI  =SL(m) 
which  are  Lebesgue  measurable  is  a  8-ring.  For  measurable  A„  the 

CO  /CD  \ 

set  A  =  U  An  is  measurable  if  and  only  if  the  measures  ^  I  U  A„  )  are 

n= 1  ' n=l  ' 

bounded  by  some  constant  which  does  not  depend  on  N. 

The  proof  of  this  assertion  is  left  to  the  reader. 

Remark.  In  our  exposition  the  measures  are  always  finite,  there¬ 
fore  the  necessity  of  the  last  condition  is  obvious. 

Theorem  1  implies  the  following 

Corollary.  The  system  Wa  of  all  sets  B  £  SO?  which  are  subsets  of 
a  fixed  set  A  £  SOI  forms  a  Borel  algebra. 

For  example,  the  system  of  all  Lebesgue  measurable  (in  the 
sense  of  the  usual  Lebesgue  measure  on  the  line)  subsets  of  any 
segment  [a,  6]  is  a  Borel  algebra  of  sets. 

In  conclusion  let  us  mention  one  more  property  of  Lebesgue 
measures. 

Definition  2.  The  measure  n  is  called  complete,  if  n{A)  =  0 
and  A'  c  A  imply  A'  £ 

It  is  obvious  that  here  n(A ')  =0.  Without  any  difficulty  one  can 
show  that  the  Lebesgue  continuation  of  any  measure  is  complete. 
This  follows  from  the  fact  that  for  A'  c  A  and  p{A )  =0  neces¬ 
sarily  ii*{A')  =  0,  and  any  set  C  for  which  n*(C)  =  0  is  meas¬ 
urable,  since  0  C  9?  and 


m*(CA  0)  =  „*(C)  =  0. 
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Let  us  point  out  the  connection  between  the  process  of  Lebesgue 
continuation  of  measures  and  the  process  of  completion  of  a  metric  space. 
For  this  let  us  note  that  m'(A  A  B)  can  be  taken  as  the  distance  between 
the  elements  A  and  B  of  the  ring  3t(Sm).  Then  9f(<Sm)  becomes  a  metric 
space  (generally  speaking  not  complete),  and  its  completion,  by  Theorem  3, 
§6,  consists  exactly  of  all  the  measurable  sets.  (Here,  however,  the 
sets  A  and  B  are  indistinguishable,  from  the  metric  point  of  view,  if 
n(A  A  B)  =  0.) 


CHAPTER  II 


MEASURABLE  FUNCTIONS 


8.  Definition  and  Basic  Properties  of  Measurable  Functions 

Let  X  and  Y  be  two  arbitrary  sets,  and  assume  that  two 
systems  of  subsets  ©  and  ©',  respectively,  have  been  selected 
from  them.  The  abstract  function  y  =  f(x),  with  the  domain  of 
definition  X,  taking  on  values  from  Y,  is  called  (©,  ©^—meas¬ 
urable  if  from  i  (  S'  it  follows  that  f~l(A)  £  ©. 

For  example,  if  we  take  as  X  and  Y  the  real  axis  D1  (i.e., 
consider  real  functions  of  a  real  variable),  and  as  ©  and  ©'  take 
the  system  of  all  open  (or  all  closed)  subsets  of  D1,  then  the  stated 
definition  of  measurability  reduces  to  the  definition  of  continuity 
(§12  of  Volume  I).  Taking  for  ©  and  ©'  the  system  of  all  Borel 
sets,  we  arrive  at  the  so-called  ^-measurable  (or  Borel  measur¬ 
able)  functions. 

In  what  follows  we  shall  be  interested  in  the  concept  of  meas¬ 
urability  mainly  from  the  point  of  view  of  the  theory  of  integra¬ 
tion.  Of  basic  importance  in  this  connection  is  the  concept  of 
^-measurability  of  real  functions,  defined  on  some  set  X,  where 
one  takes  for  ©  the  system  of  all  ^-measurable  subsets  of  the  set 
X  and  for  ©'  the  set  of  B- sets  on  the  straight  line.  For  simplicity, 
we  shall  assume  that  X  is  the  unit  of  the  domain  of  definition 
of  the  measure  y.  Since,  according  to  the  results  of  §6,  every 
(r-additive  measure  can  be  continued  to  some  Borel  algebra,  it  is 
natural  to  assume  from  the  beginning  that  <S„  is  a  B-algebra. 
Therefore  we  shall  formulate  the  definition  of  measurability  for 
real  functions  in  the  following  way: 

Definition  1.  The  real  function  /( x),  defined  on  the  set  X,  is 
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called  ^-measurable  if  for  any  Borel  set  A  of  the  real  line 

f-'(A)  e  sM. 

Let  us  denote  the  set  of  those  x  6  X  for  which  condition  Q  is 
satisfied  by  {t  :  Q). 

Theorem  1.  In  order  that  the  function  f(x)  be  n~ measurable ,  it  is 
necessary  and  sufficient  that  for  any  real  C  the  set  {x  :  f(x)  <  c] 
be  m- measurable  (i.e.,  belongs  to  S»). 

Proof.  The  necessity  of  the  condition  is  clear,  since  the  half  line 
( -  oo ,  c)  is  a  Borel  set.  To  prove  the  sufficiency  let  us  first  of  all 
note  that  the  Borel  closure  B  (s )  of  the  system  2  of  all  half  lines 
(-  oof  c)  coincides  with  the  system  B'  of  all  Borel  sets  of  the 
real  axis.  By  assumption,  f~'(  2)  Q  iS„.  But  then 

/-CB(2))  =  B(/-'(2))  C  B(SA, 

and,  since  by  assumption  S „  is  a  B-algebra,  B(SM)  =  <SM.  The 
theorem  is  thus  proved. 

Theorem  2.  The  limit  of  a  sequence  of  \i~measurable  functions 
which  converges  for  every  x  £  X  is  n-measurable. 

Proof.  Let  f„{x)  —>f(x),  then 

\x:f(x)  <  c)  =  U  U  fl  \x:fm(x)  <  c  -  H.  (1) 

k  n  m>n  [  AjJ 

Indeed,  if  f(x)  <  c,  then  there  exists  a  k,  such  that  f(x)  < 
c  —  2/k)  moreover,  for  this  k  one  can  find  an  n  large  enough  so 
that  for  m  >  n  the  inequality 

fm(x)  <  C  - 

is  satisfied,  and  this  means  that  x  will  enter  the  right-hand  side 
of  (1). 
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Conversely,  if  x  belongs  to  the  right-hand  side  of  (1 ),  then 
there  exists  a  k,  such  that,  for  all  sufficiently  large  to, 

,  ,  .  1 
U(x)  <  c  - 

k 

but  then/ (x)  <  c;  i.e.,  x  enters  the  left-hand  side  of  equation  (1 ). 
If  the  functions  /„  ( x )  are  measurable,  the  sets 

•  x:fm(x)  <  c  —  7 
k 


belong  to  S„.  Since  <S„  is  a  Borel  algebra,  the  set 

\x:f(x)  <  c) 


also  belongs  to  SM,  by  (1),  which  proves  that/(z)  is  measurable. 

For  the  further  study  of  measurable  functions  it  is  convenient 
to  represent  each  of  them  as  a  limit  of  a  sequence  of  so-called 
simple  functions. 

Definition  2.  The  function  f(x)  is  called  simple  if  it  is  ^-measur- 
able  and  takes  on  not  more  than  a  countable  number  of  values. 

It  is  clear  that  the  concept  of  a  simple  function  depends  on  the 
choice  of  the  measure  y. 

The  structure  of  simple  functions  is  characterised  by  the  follow¬ 
ing  theorem : 


Theorem  3.  The  function  f(x),  taking  on  not  more  than  a  countable 
number  of  values 

Vh  Vi,  •  —  •  •  • , 

is  ^-measurable  if  and  only  if  all  the  sets 


are  n-measurable. 


An  =  \x:f{x)  -  y„} 


Proof.  The  necessity  of  the  condition  is  obvious,  since  every  A„ 
is  the  inverse  image  of  a  set  of  one  point  {y„},  and  every  set  of 
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one  point  is  a  Borel  set.  Sufficiency  follows  from  the  fact  that  by 
the  conditions  of  the  theorem  the  inverse  image  f~'(B)  of  any 
set  BCD1  is  the  union  U  An  of  not  more  than  a  countable 

yni.  B 

number  of  measurable  sets  An,  i.e.,  is  measurable. 

The  further  use  of  simple  functions  will  be  based  on  the  follow¬ 
ing  theorem. 

Theorem  4.  In  order  that  the  function  fix )  he  n- measurable  it  is 
necessary  and  sufficient  that  it  be  representable  as  a  limit  of  a  uni¬ 
formly  convergent  sequence  of  simple  functions. 


Proof.  The  sufficiency  is  clear  from  Theorem  2.  To  show  the 
necessity,  let  us  consider  an  arbitrary  measurable  function /(t), 
and  let  us  set  f„(x)  =  m/n  if  m/n  <  f{x)  <  (to  +  l)/n  (here  to 
are  integers,  and  n  are  positive  integers).  It  is  clear  that  the 
functions  fn{x)  are  simple;  they  converge  uniformly  to  fix)  as 
n— >  since  \f(x)  -  fn(x)  \  <  1/n. 

Theorem  5.  The  sum  of  two  n-measurable  functions  is  n- meas¬ 
urable . 


Proof.  Let  us  first  show  this  assertion  for  simple  functions.  If 
f{x)  and  g(x)  are  two  simple  functions  taking  on  the  values 


and 


fhfi,  *••,/», 


Qh  •••  On, 


respectively,  then  their  sum  h(x)  =  fix)  +  g(x)  can  take  on 
only  the  values  h  =  /;  +  gt,  where  each  of  these  values  is  taken 
on  on  the  set 


\x:h(x)  =  h]  =  U  (\x:f(x)  = /,•}  f|  \x-.g{x)  =  g,-}).  (2) 

fi+Bj—h 

The  number  of  possible  values  h  is  finite  or  countable  and  the 
corresponding  sets  \x  :  h(x)  =  h }  are  measurable,  since  the 
right-hand  side  of  equation  (2)  is  obviously  a  measurable  set. 
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To  prove  the  theorem  for  arbitrary  measurable  functions /(t) 
and  g{x),  let  us  consider  sequences  of  simple  functions  {fn(x) }  and 
{</„(£)}  which  converge  to/(x)  and  g(x),  respectively.  Then  the 
simple  functions  fn(x)  +  g«(x)  converge  uniformly  to  the  func¬ 
tion /(t)  +  g{x),  which,  by  Theorem  4,  is  measurable. 

Theorem  6.  A  B-measurable  function  of  a  ^-measurable  function 
is  ix- measurable . 

Proof.  Let  f{x)  =  <f>[ i(x)~\,  where  xp  is  Borel  measurable  and  xp 
is  ^-measurable.  If  A  c  Dl  is  an  arbitrary  /^-measurable  set,  then 
its  inverse  image  A'  =  <p~'{A)  is  B-measurable,  and  the  inverse 
imaged"  =  xp~l(A')  of  theset  A'  is  /^-measurable.  Since /-'(A)  = 
A  ",  the  function  /  is  measurable. 

The  theorem  just  proved  is  applicable,  in  particular,  in  the 
case  of  continuous  functions  <p  (they  are  always  B-measurable). 

Theorem  7.  The  product  of  ix-measur able  functions  is  n-measurable . 

Proof.  Since  fg  =  \  [(/  +  gY  —  (/  —  sO2],  the  assertion  follows 
from  Theorems  5  and  6,  and  the  fact  that  <p(t)  -  t2  is  a  con¬ 
tinuous  function. 

Exercise.  Show  that  if  f(x)  is  measurable  and  does  not  take  on 
the  value  zero,  then  1  /f(x)  is  also  measurable. 

In  studying  measurable  functions  one  can  often  neglect  the 
values  of  a  function  on  a  set  of  measure  zero.  In  connection  with 
this  let  us  introduce  the  following 

Definition.  Two  functions  /  and  g,  defined  on  one  and  the  same 
measurable  set  E,  are  called  equivalent  (denoted  by/ ~  g),  if 

ix\x:f(x)  9^  g{x))  =  0. 

One  says  that  some  property  is  satisfied  almost  everywhere  on  E, 
if  it  is  satisfied  at  all  points  of  E  with  the  exception  of  points 
which  form  a  set  of  measure  zero.  Thus  we  can  say  that  two 
functions  are  equivalent  if  they  coincide  almost  everywhere. 

Theorem  8.  If  two  functions  f  and  g  which  are  continuous  on 
some  segment  E  are  equivalent,  then  they  coincide. 
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Proof.  Let  us  assume  that,  at  some  point  x0,f(x 0)  ^  g(x 0),  i.e., 
/(To)  —  g(x0)  5^  0.  Since/  —  g  is  a  continuous  function,  one  can 
find  a  neighbourhood  of  the  point  t0,  at  all  points  of  which  the 
function  /  —  g  is  different  from  zero.  This  neighbourhood  has  a 
positive  measure;  thus 

^  g{x) }  >  0, 

i.e.,  the  continuous  functions  /  and  g  cannot  be  equivalent  if  they 
take  on  different  values  at  at  least  one  point. 

It  is  obvious  that  the  equivalence  of  two  arbitrary  measurable 
functions  (i.e.,  in  general,  discontinuous)  does  not  at  all  imply 
their  identity;  for  example,  the  function  which  is  equal  to  unity 
at  rational  points  and  equal  to  zero  at  irrational  points  is  equiva¬ 
lent  to  the  function  which  is  identically  zero. 

Theorem  9.  The  function  f(x)  which  is  defined  on  some  measurable 
set  E  and  is  equivalent  on  this  set  to  some  measurable  function  g(x) 
is  also  measurable. 

Indeed,  from  the  definition  of  equivalence  it  follows  that 
{ x:f(x)  >  a)  and  [x\g(x)  >  a } 

can  differ  from  each  other  only  by  a  set  of  measure  zero;  hence  if 
the  second  is  measurable,  so  is  the  first. 

The  definition  of  a  measurable  function  given  above  is  quite  formal. 
In  1913,  N.  N.  Luzin  proved  the  following  theorem  which  shows  that 
measurable  functions  are  functions  which,  in  a  well  known  sense,  can  be 
approximated  by  continuous  functions. 

Luzin's  Theorem.  In  order  that  the  function  fix)  be  measurable  on  the 
segment  [a,  6]  it  is  necessary  and  sufficient  that  for  any  «  >  0  there  exist  a 
function  <p(x)  which  is  continuous  on  [a,  6],  and  such  that 

m{t:/(x)  ^  <p(x) }  <  s. 

In  other  words,  a  measurable  function  can  be  made  continuous,  if  one 
excludes  from  consideration  its  values  on  a  set  of  arbitrarily  small 
measure.  This  property  which  Luzin  called  the  C-property  can  be  taken 
as  a  definition  of  a  measurable  function. 
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9.  Sequences  of  Measurable  Functions. 

Different  Types  of  Convergence. 

Theorems  5  and  7  of  the  preceding  section  show  that  arithmetic 
operations  on  measurable  functions  again  lead  to  measurable 
functions.  According  to  Theorem  2  of  §8,  the  class  of  measurable 
functions,  in  contrast  to  the  set  of  continuous  functions,  is  also 
closed  with  respect  to  the  operation  of  going  to  the  limit.  For 
measurable  functions  it  is  expedient  to  introduce,  aside  from  the 
usual  convergence  at  every  point,  several  other  definitions  of 
convergence.  These  definitions  of  convergence,  their  basic  proper¬ 
ties  and  the  connections  between  them  will  be  investigated  in  the 
present  section. 

Definition  1.  The  sequence  of  functions  f„(x),  defined  on  some 
space  with  measure  X,  is  said  to  converge  almost  everywhere  to  the 
function  F(x),  if 

lim  fn(x)  =  F(x )  (1) 

n-*oo 

for  almost  all  x  €  X  (i.e.,  the  set  of  those  points  at  which  (1) 
does  not  hold  has  measure  zero). 

Exam-pie.  The  sequence  of  functions  fn{x)  =  ( —x)n,  defined 
on  the  segment  [0, 1],  converges  as  n  —>  =°  to  the  function  F(x)  = 
0  almost  everywhere  (precisely,  everywhere  with  the  exception  of 
the  point  x  =  1). 

Theorem  2  of  §8  admits  of  the  following  generalisation. 

Theorem  1.  If  the  sequence  of  measurable  functions  fn(x)  con¬ 
verges  to  the  function  F(x)  almost  everywhere,  then  F(x)  is  also 
measurable. 

Proof.  Let  A  be  that  set  on  which 

lim  /„  (x)  =  F(x). 

n-+  co 

By  the  condition,  n(E\A )  =  0.  The  function  F(x)  is  measurable 
on  A  by  Theorem  2,  §8.  Since  on  a  set  of  measure  zero  every 
function  is  obviously  measurable,  F(x)  is  measurable  on  E\A; 
therefore,  it  is  also  measurable  on  the  set  E. 
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Exercise.  Let  the  sequence  of  measurable  functions  f„(x)  con¬ 
verge  almost  everywhere  to  some  limit  function  fix).  Show  that 
the  sequence  f„(x)  converges  to  g(x)  if  and  only  if  g(x)  is  equiva¬ 
lent  to  f(x). 

The  following  important  theorem,  proved  by  D.  F.  Egorov, 
establishes  the  connection  between  the  concept  of  convergence 
almost  everywhere  and  uniform  convergence. 

Theorem  2.  Let  the  sequence  of  measurable  functions  fn(x)  con¬ 
verge  on  E  almost  everywhere  to  f(x).  Then  there  exists  for  any 
S  >  0  a  measurable  set  Es  C  E  such  that 

1)  ix(Es)  >  n(E)  -  S, 

2)  on  the  set  Es  the  sequence  fn(x)  converges  uniformly  tof(x). 
Proof.  By  Theorem  1,  the  function /(z)  is  measurable.  Set 

K  =  n  I* :  !/.-(*)  -  /Or)  I  <  -l 

i^n  \  Ytl) 

Thus  F",  for  fixed  m  and  n,  denotes  the  set  of  those  points  x 
for  which 

\fi{x)  -  f(x)  I  <  - 
m 

for  all  i  >  n.  Let 

E”  =  U  K- 

From  the  definition  of  the  set  E™  it  is  clear  that  for  fixed  m, 

E?  c  £7  c  ...  c  c  ..., 

therefore,  because  of  the  fact  that  the  o--additive  measure  is 
continuous,  one  can  find  for  any  m  and  any  6  >  0  an  n((m) 
such  that 


E  5  f]  (m) 

m 


Let  us  set 
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and  show  that  the  Es  so  constructed  satisfies  the  requirements  of 
the  theorem. 

Let  us  first  prove  that  on  Es  the  sequence  \fdx)  |  converges 
uniformly  to  the  function  f(x).  This  follows  immediately  from 
the  fact  that  if  x  £  Es ,  then  for  any  m 

| fi(x)  —  f(x)  |  <—  for  i  > 
m 

Let  us  now  evaluate  the  measure  of  the  set  E\ES.  For  this  let  us 
note  that  for  every  m,  fi(E\Em )  =  0.  Indeed,  if  xo  £  E\Em, 
then  there  exist  arbitrarily  large  values  of  i  for  which 

\fi(xo)  -  f(x o)  j  >  — , 
m 

i.e.,  the  sequence  Ifn(x') }  does  not  converge  to  f(x)  at  the  point 
x0.  Since  by  assumption  [f„(x)\  converges  to  f(x)  almost  every¬ 
where, 

n(E\Em)  =  0. 

This  implies 

r(E\Ez(m))  =  n(E”\E% ;<«>)  <  ^ 

Hence 

n(E\Es)  =  M(F\n  E”{m))  =  m(U  (E\EZm)) 

<  ZM(E\EZ(m))  <  =  s. 

m  m= 1  ^ 

The  theorem  is  thus  proved. 

Definition  2.  One  says  that  the  sequence  of  measurable  functions  /„ (x) 
converges  in  measure  to  the  function  F{x),  if  for  any  <r  >  0 

lim  n\x:  \f„(x)  —  F(x)  j  >  a-}  =0. 

71 -►CO 

Theorems  3  and  4  given  below  establish  the  connection  between  con¬ 
vergence  almost  everywhere  and  convergence  in  measure. 
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Theorem  3.  If  a  sequence  of  measurable  functions  fn(x)  converges  almost 
everywhere  to  some  function  Fix),  then  it  converges  to  the  same  limit  function 
F{x)  in  measure. 

Proof.  Theorem  1  implies  that  the  limit  function  F(x)  is  measurable. 
Let  A  be  that  set  (of  measure  zero)  on  which  /„  (x)  does  not  converge  to 
Fix).  Let,  moreover, 

EM  =  {*:|/*(*)  -  F(x)  |  >  v},  Rn(t r)  =  0  EM, 

k=n 


M  =  f|  Rn(t t). 

71=1 


It  is  clear  that  all  these  sets  are  measurable.  Since 

RM  D  EM  D  •••, 

we  have,  because  the  measure  is  continuous, 

n(R„(<r))  -+ n(M)  for  n— ►«>. 
Let  us  now  check  that 

M  C  A. 


Indeed,  if  x0  (£  Ax,  i.e.,  if 

lim/„(z0)  =  F(x  0), 


then,  for  a  given  <r  >  0,  we  can  find  an  n  such  that 
|/„(z0)  -  F(x o)  I  <  <T, 
i.e.,  Xo  ^  E„(<r )  and  hence  xQ  ^  M. 

But  tx(A)  =  0,  therefore  (2)  implies  that  y(M)  =  0  and  hence 
tx(Rn(( t))  — >  0  for  n—»  oo, 


(2) 


since  En((f)  C  Rn(<r).  The  theorem  is  proved. 

It  is  not  difficult  to  convince  oneself  by  an  example  that  the  con¬ 
vergence  in  measure  of  a  sequence  of  functions  does  not  imply  con¬ 
vergence  almost  everywhere.  Indeed,  let  us  define  for  each  natural  k 
on  the  open  segment  (0,  1]  k  functions 

/.<*>, Uk\  ••  •/*<*> 
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by  the  following  method : 

ft\x)  = 


1  for  —  <  x  <  j, 
k  k 


[O  for  all  other  values  of  x. 


Numbering  all  these  functions  in  order,  we  obtain  a  sequence  which, 
as  is  easy  to  check,  converges  in  measure  to  zero,  but  at  the  same  time 
does  not  converge  at  a  single  point  (prove  this!). 

Exercise.  Let  the  sequence  of  measurable  functions  f„(x)  converge  in 
measure  to  some  limit  function  f(x).  Show  that  the  sequence  will  converge 
in  measure  to  the  function  g(x)  if  and  only  if  g(x)  is  equivalent  to  f(x). 
Even  though  the  example  given  above  shows  that  Theorem  3  cannot 
be  completely  reversed,  the  following  theorem  holds : 

Theorem  4.  Let  the  sequence  of  measurable  functions  fn(x)  converge  in 
measure  to  f(x).  Then  one  can  select  from  the  sequence  \f„(x) }  a  subsequence 
{ f„k(x) )  which  converges  to  f{x)  almost  everywhere. 

Proof.  Let  e,,  s2,  •  •  •  be  some  sequence  of  positive  numbers  for  which 


lim  s„  =  0, 

n-*oo 


and  let  the  positive  numbers  vi,  m,  ‘  V*>  be  such  that 

Vi  +  V2  +  •  *  • 

converges.  Let  us  construct  a  sequence  of  indices 

Ul  <  U2  <  •  •  • 

in  the  following  way :  ni  is  a  natural  number  such  that 

|/»i(*)  -  /(*)  I  >  si)  <  Vi 

(such  an  n,  certainly  exists) ;  further,  let  n2  be  a  number  such  that 

n{x:  | f„2(x)  -  f(x)  |  >  e2j  <  772,  n2  >  nh 

In  general,  let  n*  be  a  number  such  that 

ix{x:  \fnk{x)  -f{x)  \  >  e*}  <  Vk,  nk  > 

We  shall  show  that  the  sequence  we  have  constructed  converges  to 
f(x)  almost  everywhere.  Indeed,  let 

^1=0  \x:  I /.,(*)  -  f(x)  I  >  eA},  Q  =  f\  R{. 

k=i  i=l 
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Since 


RiD  R2D  RsD  •  •  ■.  D  Rn  D 


the  continuity  of  the  measure  implies  n(Ri)  — >  n(Q). 


00 

On  the  other  hand,  it  is  clear  that  fi(Ri)  <  ^  which  yields 
p(R,)  — » 0  for  i  — *  <® .  Since  — *  0,  k=i 


fi(Q)  =  0. 

It  remains  to  show  that  at  all  points  of  the  set  E\Q,  the  relation 

Ut(x)  —*f(x) 

holds.  Let  xa  6  E\Q.  Then  one  can  find  an  i0  such  that  x0  (£  This 
means  that  for  all  k  >  i0 

xo  £  \x:\fnk(x)  —  f(x)  |  >  zk\, 


i.e., 


|/»*(zo)  ~  f(x 0)  |  <  s*. 
Since,  by  assumption,  st  — >  0, 

lim  fnk(xo)  =f(x  0). 


n-*oo 


The  theorem  is  proved. 


CHAPTER  III 


THE  LEBESGUE  INTEGRAL 


In  the  preceding  chapter  we  investigated  the  basic  properties 
of  measurable  functions,  which  are  a  quite  wide  generalisation  of 
continuous  functions.  For  measurable  functions,  the  classical 
definition  of  an  integral  known  from  analysis  and  usually  called 
the  Riemann  integral  is  generally  not  applicable.  For  example, 
the  well  known  Dirichlet  function  which  equals  zero  at  irrational 
points  is  obviously  measurable,  but  is  not  integrable  in  the  sense 
of  Riemann.  Thus  this  concept  of  an  integral  turns  out  to  be  of 
little  use  with  respect  to  measurable  functions. 

The  reason  for  this  is  clear.  Let  us  assume,  for  simplicity,  that 
we  are  considering  the  functions  on  a  segment.  Introducing  the 
concept  of  the  Riemann  integral,  we  split  the  segment  on  which 
the  function  f(x)  is  given  into  small  segments,  and  taking  in 
each  of  these  parts  an  arbitrary  point  $*,  we  form  the  sum 

k 

Essentially,  we  replace  here  the  value  of  the  function  /( x )  at 
every  point  of  the  segment  Axk  =  £*+,]  by  its  value  at  some 

arbitrarily  selected  point  £*  of  this  interval.  However,  it  is  only 
natural  to  do  this  if  the  values  of  the  function  f(x)  at  neigh¬ 
bouring  points  are  close  to  each  other,  i.e.,  if  f(x)  is  Continuous 
or  if  the  set  of  its  points  of  discontinuity  is  “not  too  large”.* 

The  basic  idea  of  the  Lebesgue  integral  consists  of  the  fact  that, 
as  opposed  to  the  Riemann  integral,  the  points  x  are  grouped  not 
by  their  closeness  on  the  z-axis,  but  by  the  closeness  of  the  values 
of  the  functions  at  these  points.  This  immediately  gives  rise  to 


*  As  is  well  known,  a  bounded  function  is  integrable  in  the  sense  of  Riemann  if  and 
only  if  the  set  of  its  points  of  discontinuity  has  measure  zero. 
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the  possibility  of  generalising  the  concept  of  the  integral  to  a 
quite  wide  class  of  functions. 

Moreover,  the  Lebesgue  integral  is  defined  in  exactly  the  same 
way  for  functions  which  are  defined  in  any  spaces  with  measures, 
whereas  the  Riemann  integral  is  first  introduced  for  functions  of 
one  variable,  and  only  then  is  it  carried  over  with  corresponding 
changes  to  the  case  of  several  variables. 

Everywhere,  where  the  contrary  is  not  especially  stated,  we 
shall  consider  some  <r-additive  measure  n(A)  which  is  defined  on  a 
Borel  algebra  of  sets  with  a  unit  X.  All  the  sets  A  c  X  con¬ 
sidered  will  be  assumed  to  be  <r-measurable,  and  the  functions 
f(x)  to  be  defined  for  x  €  X  and  ^-measurable. 


10.  The  Lebesgue  Integral  for  Simple  Functions 

We  shall  first  introduce  the  concept  of  a  Lebesgue  integral  for 
functions  which  we  called  simple  in  the  preceding  section,  i.e.,  for 
measurable  functions  which  take  on  a  finite  or  countable  number 
of  values. 

Let  f(x)  be  some  simple  function  which  takes  on  the  values 

2/i,  2/2,  *  ”,  2/»,  *  •  *,  Vi  9*  yi  for  i  ^  j. 

It  is  natural  to  define  the  integral  of  the  function /(t)  over  the 
set  A  by  the  equation 

f  f(x)  dfx  =  {x\x  e  A,  f(x)  =  y„}.  (1) 

J  A  n 

Thus  we  arrive  at  the  following  definition. 

Definition.  The  simple  function /(t)  is  called  integrable  (with  re¬ 
spect  to  the  measure  n )  over  the  set  A  if  the  sequence  (1 )  con¬ 
verges  absolutely.  If  f(x)  is  integrable,  then  the  sum  of  the 
series  (1)  is  called  the  integral  of  f(x)  over  the  set  A. 

In  this  definition  we  assume  that  all  the  yn  are  different.  One 
can,  however,  represent  the  value  of  the  integral  of  a  simple 
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function  as  a  sum  of  products  of  the  form  ckn(Bk)  and  not  assume 
that  all  the  ck  are  different.  The  following  lemma  allows  us  to  do 
this: 

Lemma.  Let  A  =  U  Bk,  B(  n  B}  =  0  for  i  ^  j,  and  assume  that 

k 

on  each  set  Bk  the  function  f(x)  takes  on  only  one  value  ck.  Then 
f  f(x)  d»  =  Z  etn(Bk) ;  (2) 

J  A  k 

moreover,  the  function  f(x)  is  integrable  over  A  if  and  only  if  the 
series  (2)  converges  absolutely. 

Proof.  It  is  easy  to  see  that  every  set 

An  =  { x:x  6  A,  f{x)  =  yn\ 

is  the  union  of  those  Bk  for  which  ck  =  yn.  Therefore 

=  Z</n  E  n(Bt)  =  Ec*m(£*). 

»  n  ck=yn  k 

Since  the  measure  is  non-negative, 

ZIv.ImM.)  =  £|y„|  E  m(&)  = 

n  n  ck=Vn  k 

i.e.,  the  series  and  t)  both  either  converge 

n  k 

absolutely  or  diverge. 

Let  us  establish  some  properties  of  the  Lebesgue  integral  for 
simple  functions: 

A)  f  f(x)  dfi  +  f  g{x)  dn  =  [  {  f(x)  +  g{x) } 

moreover,  from  the  existence  of  the  integrals  on  the  left-hand 
side  it  follows  that  the  integrals  on  the  right-hand  side  exist. 

To  prove  this  let  us  assume  that /(a:)  takes  on  the  values  /<  on 
the  sets  c  A,  and  g(x)  the  values  gi  on  the  sets  G;  c  A,  since 


Ji  =  [  fix)  rfM  =  Efr(L'), 

*  A  i 

(3) 

J2  =  I  g(x)  dn  = 

J  A  i 

(4) 
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Then,  by  the  lemma, 

J  =  I  { fix)  +  g(x)  1  d»  =  Z  Zifi  +  9l)niF t  n  <?,) ;  (5) 

J  A  i  i 

but 

m(^)  =  Z^n(?,), 

J 

nidi)  =  Zdtf.ne,). 

i 

Hence,  from  the  absolute  convergence  of  the  series  (3)  and  (4), 
there  follows  the  absolute  convergence  of  the  series  (5);  here 

J  =  Ji  “b  Ji. 

B)  For  any  constant  k, 

k  f  f(x )  dfi  =  f  { kf(x ) }  dn ; 

J  A  J  A 

moreover,  the  existence  of  the  integral  on  the  left-hand  side  implies 
the  existence  of  the  integral  on  the  right.  (This  can  be  shown 
immediately. ) 

C)  A  simple  function /(t)  which  is  bounded  on  the  set  A  is 
integrable  over  A ;  moreover,  if  \f(x)  \  <  M  on  A,  then 

fix )  dn  <  MuiA). 

(This  can  be  shown  immediately. ) 

11.  General  Definition  and  Basic  Properties  of  the 
Lebesgue  Integral 

Definition.  We  shall  say  that  the  function  fix)  is  integrable  over 
the  set  A,  if  there  exists  a  sequence  of  simple  functions  f„(x) 
which  are  integrable  over  A  and  converge  uniformly  to  fix).  We 
shall  denote  the  limit 

J  =  lim  f  f„ix)  dn 

A 


n~*  co 


(1) 
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/  fix)  d/I 

J  A 

and  call  it  the  integral  of  the  function  f(x)  over  the  set  A. 

This  definition  is  correct  if  the  following  conditions  are  satisfied: 

1.  The  limit  (1)  for  any  uniformly  convergent  sequence  of 
simple  functions  which  are  integrable  over  A  exists. 

2.  This  limit  for  a  given  function /(t)  does  not  depend  on  the 
choice  of  the  sequence  { fn(x)}. 

3.  For  simple  functions  the  definitions  of  integrability  and 
integral  are  equivalent  to  those  given  in  §10. 

All  these  conditions  are  indeed  satisfied. 

To  prove  the  first  it  suffices  to  note  that  by  properties  A), 
B)  and  C)  for  integrals  of  simple  functions, 


j  fn ix)  d/x  I  f mix)  d/i 

"  4  J  4 


<  ft  (A)  sup  |  f„(x)  -  fm(x) 

xeA 


To  prove  the  second  condition,  we  must  consider  the  two 
sequences  ifn(x) }  and  {/„*(*) ),  and  use  the  fact  that 


J  f„(x)  d/i  -  J  fn*ix) 


dfi 


<  niA) 


sup  I fnix)  -  fix)  I  +  sup  I f«*(x)  -  fix) 

xtA  xtA 


Finally,  to  prove  the  third  condition  it  suffices  to  consider  the 
sequence  /„(  x )  =  /( x ) . 

Let  us  establish  the  basic  properties  of  the  Lebesgue  integral. 

Theorem  1 . 

f  1-d/i  =  /i  (A). 

J  A 


Proof.  It  follows  immediately  from  the  definition  of  the  integral. 
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Theorem  2.  For  any  constant  k, 

k  f  fix)  dp  =  f  { kf(x)}  dn, 

J  A  •'A 

where  the  existence  of  the  integral  on  the  left-hand  side  implies  the 
existence  of  the  integral  on  the  right. 

Proof.  The  proof  is  obtained  from  property  B)  by  proceeding 
to  the  limit  for  an  integral  of  simple  functions. 

Theorem  3. 

[  fix)  d/x  +  f  g(x)  dn  =  f  {fix)  +  gix)]  dn, 

J  a  *  A  ’'A 

where  the  existence  of  the  integral  on  the  left  implies  the  existence  of 
the  integral  on  the  right. 

Proof.  The  proof  is  obtained  from  property  A)  by  proceeding 
to  the  limit  for  an  integral  of  simple  functions. 

Theorem  4.  A  function  fix)  which  is  bounded  on  the  set  A  is  integra- 
ble  over  A. 

Proof.  The  proof  is  obtained  from  property  C)  by  proceeding 
to  the  limit  for  an  integral  of  simple  functions. 

Theorem  5.  If  fix)  >  0,  then 

f  fix)  d»>  0 

J  A 

(i assuming  that  the  integral  exists). 

Proof.  For  simple  functions  this  follows  immediately  from  the 
definition;  for  the  general  case  the  proof  is  based  on  the  possi¬ 
bility  of  approximating  non-negative  functions  by  non-negative 
simple  functions  (for  example  by  the  method,  given  in  the  proof 
of  Theorem  4  of  §9). 

Corollary  1.  If  fix)  >  gix),  then 

/  fix)  d/x>  gix)  dfx. 

J  A  J  A 
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Corollary  2.  If  on  A,  m  <  fix)  <  M,  then 

mp(A)  <  [  f{x)  dy  <  Mp(A). 

J  A 

Theorem  6.  If 

A  =  U  An,  Ai  n  Aj  =  0  for  i  0  j, 

n 

then 

[  fix)  dy=Z[  fix)  dy, 

J  A  n  JAn 

moreover,  the  existence  of  the  integral  on  the  left-hand  side  implies 
the  existence  of  the  integrals  and  the  absolute  convergence  of  the  series 
on  the  right. 

Proof.  Let  us  first  check  the  assertion  of  the  theorem  for  a  simple 
function /(a;),  which  takes  on  the  values 


Let 


Vh  2/2,  •  •  •  2 h, 


Bk  =  {x:x  €  A,  f{x)  =  yk), 


B„k  =  \x:x  €  An,  fix)  =  yk}; 

then 

[  fix)  dy  =  ZvkuiBk)  =  Zltk  X) /*(#»*) 

J  A  k  k  n 

=  ZZl ikniBnk)  =  X  /  fix)  dy.  (1) 

n  k  n  J  An 

Since,  under  the  assumption  of  integrability  of  fix)  over  A,  the 
series  Z  VkniBk)  converges  absolutely,  and  the  measure  is  non- 

k 

negative,  all  the  other  series  of  the  chain  of  equations  (1 )  also 
converge  absolutely. 

In  the  case  of  an  arbitrary  function  fix),  its  integrability  over 
A  implies  that  for  any  e  >  0  there  exists  a  simple  function  g(x) 
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which  is  integrable  over  A  and  satisfies  the  condition 


For  g(x), 


I  fix)  -  g(x)  |  <  c. 


Z  /  g(x)  <V, 

n  J  An 


(2) 

(3) 


moreover,  g(x)  is  integrable  over  every  set  An  and  the  series  (3) 
converges  absolutely.  This  last  fact  and  the  estimate  (2)  imply 
that/fz)  is  also  integrable  over  every  A„,  and 


/  f(x)  dn  -  g(x)  d^ t 

J A _  "A- 


<  Z  ZH(An)  <  6 l*(A), 


/  f(x)  dn  -  g{x)  dfi 

J  A  J  A 


<  e»(A), 


which  together  with  (3)  yields  the  absolute  convergence  of  the 


series  Z  /  f(x)(b  and  the  estimate 

J  A 

Z  /  fix)  dp  -  f  f(x)  d» 

n  j  A  _  J  A 


n  J  A 

Since  e  >  0  is  arbitrary, 


<  2s. 


Z  f  fix)  dn  =  [  fix)  dfi. 

n  J An  JA 

Corollary.  If  fix)  is  integrable  over  A,  then  fix)  is  also  integrable 
over  any  A'  c  A. 

Theorem  7.  If  the  function  <pix)  is  integrable  over  A  and  \  fix)  \  < 
(p(x),  then  fix)  is  also  integrable  over  A. 

Proof.  If  fix)  and  (fix)  are  simple  functions,  then  A  can  be 
represented  as  a  union  of  a  finite  or  countable  number  of  sets,  on 
each  of  which  fix)  and  <pix)  are  constant: 


fix)  =  a„,  <p ix)  =  a„,  where  |a„|  <  a, 
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Since  <p(x)  is  integrable,  we  have 

^|a„|M(4„)  <  22  a„n(An)  =  /  <p{x)  dn. 

n  n  *  A 

Therefore /(t)  is  also  integrable  and 

f  f(x)  dn  =  22  <  £|a„|yu(4n)  =  f  \f(x)\dn. 

J  A  n  n  J  A 

For  the  general  case  the  theorem  is  proved  by  proceeding  to 
the  limit. 

Theorem  8.  The  integrals 

Ji  =  f  f{x)  dn,  J2  =  [  \f(x)\dn 

J  A  J  A 

exist  or  do  not  exist  simultaneously. 

Proof.  From  the  existence  of  the  integral  J2  there  follows  the 
existence  of  the  integral  J i  by  Theorem  7. 

The  converse  follows,  in  the  case  of  a  simple  function,  from  the 
definition  of  the  integral,  and  in  the  general  case  it  is  proved  by 
going  to  the  limit  and  using  the  fact  that  always 

II® I  -  l&ll  <  I®  -  &l- 

Theorem  9.  (Tehebichev  Inequality)  If  <p(x)  >  0  on  A,  then 


/  < p(x)  dn=  <p(x)  dp  +  /  <p(x)  dn  >  I  <p{x)  dy.  >  cn(A') . 

Ja  JA>  A\Af  •>  A’ 
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Corollary.  If 


f  \f{x)  \dii  =  0, 

J  A 


thenf(x)  =  0  almost  everywhere. 

Indeed,  by  the  Tchebichev  inequality,  we  have 


n\x:x  6  A,  | f(x)  |  > 


<  n  f  | f(x) 

J  A 


dn  =  0 


for  all  n.  Therefore 


n{x-.x  £  A,  f(x)  ^  0)  <  2  v\x:x  €  A,  \f(x)  \  >  - }  =0. 

n 


12.  Limiting  Processes  Under  the  Lebesgue  Integral  Sign 

The  problem  of  proceeding  to  the  limit  under  the  integral  sign, 
or,  what  is  the  same  thing,  of  the  possibility  of  term  by  term 
integration  of  a  convergent  series,  is  often  encountered  in  various 
problems. 

It  was  established  in  classical  analysis  that  a  sufficient  condition 
for  the  possibility  of  such  a  limiting  process  is  the  uniform  con¬ 
vergence  of  the  corresponding  series. 

In  this  section  we  shall  derive  some  theorems  concerning  limit¬ 
ing  processes  under  the  Lebesgue  integral  sign,  which  represent 
quite  far  reaching  generalisations  of  the  corresponding  theorems 
of  classical  analysis. 

Theorem  1.  If  the  sequence  fn(x)  on  A  converges  to  f(x)  and  if  for 
all  n 

\fn(x)  |  <  <p{x), 

where  <p(x)  is  integrable  on  A,  then  the  limit  function  f  (x)  is  integ- 
rable  on  A  and 

J  fn(x)  dn^>  f  f(x)  dix. 
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Proof.  From  the  conditions  of  the  theorem  it  easily  follows  that 
| f(x)  I  <  tp(x).  Let 

Ak  =  {x:k  —  1  <  <p(x)  <  7c } ; 


and  also  let 

Bm  =  U  Ak  =  {x: <p(x)  >  m\. 

k—m 

By  Theorem  6  of  the  preceding  section 

/  tfi(x)  dn=J2  <p(x)  dn, 

J  A  k 

and  the  series  ( *  )  converges  absolutely. 

Here 


(*) 


H  /  <p(x)  dji. 

k  —  m  *  Afe 


From  the  convergence  of  the  series  (*  )  there  follows  the  existence 
of  an  to  such  that 

/  <p{x)  dn  <  A 

jb„  5 

The  inequality  <p(x)  <  to  holds  on  A  \Bm.  By  Egorov’s  theorem, 
A\Bm  can  be  represented  in  the  form  A\Bm  =  C  u  D,  where 
n{D)  <  s/5  to  and  on  the  set  C  the  sequence  {  /„}  converges  uni¬ 
formly  to  /. 

Let  us  select  an  N  such  that,  for  n  >  N,  on  the  set  C 

[/.M  -/Ml  <^. 

Then 


f  ( fn{x)  -  f(x))  dn  =  f  }n{x)  dfi  —  [  f{x)  dfi 
JA  JB„  ->Bm 

+  f  fn(x)  dfi  —  f  f(x)  dn  +  f  (fn(x)  -  f{x))  dn  <  =  s. 

JD  JD  O 
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Corollary.  If  \fn(x)\  <  M  and f n(x)  — > / (x) ,  then 
/  fn(x )  dn->  fix)  dp. 

JA  J  A 

Remark.  In  as  much  as  the  values  which  the  function  takes  on 
on  the  set  of  measure  zero  do  not  influence  the  value  of  the 
integral,  it  suffices  to  assume  in  Theorem  1  that  [fn(x)\  con¬ 
verges  to  f(x)  almost  everywhere. 

Theorem  2.  Let,  on  the  set  A, 

fl(x)  <  MX)  <  •  •  <  fn(x)  <  •", 

where  the  functions  fn(x)  are  integrable  and  their  integrals  are  not 
greater  than  a  certain  constant: 


f  }n{x)  dti  <  K. 

J  A 


(1) 


Then  almost  everywhere  on  A  the  limit 

f(x)  =  lim/„U) 

n-*  cd 

exists,  the  function  f(x )  is  integrable  on  A,  and 
/  fn(x)  dfi  -*  /  f{x)  dn . 

•'A  •'A 

On  a  set  on  which  the  limit  (1 )  does  not  exist,  the  function 
f(x)  can  be  given  arbitrarily,  for  example  assuming  on  this  set 
fix)  =  0. 

Proof.  Let  us  assume  /i(x)  >  0,  since  the  general  case  can  be 
easily  reduced  to  this  by  going  over  to  the  functions 

fn(x)  =  fn(x)  -fl{x). 

Let  us  consider  the  set 

12  =  {x:x  G  A,fn{x)  — >  oo  }. 

It  is  easy  to  see  that  12  =  fl  U  12n(r),  where 


12„w  =  {x:x  £  A,fn(x)  >  r\. 
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By  Tchebischev’s  inequalities  (Theorem  9,  §11), 

M(9«(r))  <  -  - 
r  , 

Since 

Qi«  c  Q,w  c  ...  c  o„w  c 

we  have  n  (  U  0„(r)  j  <  if/r,  but  from  the  fact  that  for  any  r 

'  n  ' 

sicU  fi„w, 

n 

it  follows  that  /*  (9 )  <  K/r.  Since  r  is  arbitrary 

m(9)  =  0. 

This  proves  that  the  monotone  sequence  fn(x)  has  a  finite  limit 
almost  everywhere  on  A. 

Let  us  now  set  <p(x)  =  r  for  those  x  for  which 

r  —  1  <  f(x)  <  r,  r  -  1,  2,  •  •  • . 

If  one  can  show  that  <p(x )  is  integrable  on  A,  then  the  assertion 
of  our  theorem  will  be  an  immediate  consequence  of  Theorem  1. 

Let  us  denote  by  Ar  the  set  of  those  points  x  £  A  for  which 
<p(x)  =  r  and  let  us  set 

B,  =  U  Ar. 

r= 1 

Since  on'jBs  the  functions  fn{x)  and  f(x)  are  bounded,  and 
since,  always  <p{x)  <  f(x)  +  1, 

f  <p(x)  dn  <  f  f(x)  dn  +  n(A) 

JB, 


—  lim  f  }n{x)  dft  +  n(A)  <  K  +  n(A). 

n~* oo  *  B  s 


But 


I  <p(x)  dfi  =  £  rii(Ar). 

J  R.  r=1 
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However,  the  boundedness  of  these  sums  means  that  the  series 

£  rti(Ar)  =  j  <p(x)  dp 

r=l  J  A 

converges.  Thus,  it  is  proved  that  <p(x)  is  integrable  on  A. 
Corollary.  If  ipn(x)  >  0  and 


in{x)  dn  < 


oo 

the  series  lAx)  converges  almost  everywhere  on  A  and 

n=l 


f  (jl  'l'n(x))  dp  =  £  f  4<n(x)  dp. 

J A  \n=- 1  /  n=l  JA 

Theorem  3.  (Fatou)  If  the  sequence  of  measurable  non-negative 
functions  {/„(*)}  converges  almost  everywhere  on  A  tof(x)  and 

f  fn  (x)  dp  <  K, 

JA 

then  /( x )  is  integrable  on  A  and 

f  f{x)  dp  <  K. 

•'A 

Proof.  Let  us  set 

<pn(x)  =  inf  fk(x); 

k^n 

<pn(x)  is  measurable  because 

\x:tpn(x)  <  c(  =  U  \x:fk{x)  <  c). 

k^n 

Moreover,  0  <  <pn(x)  <  fn{x),  hence  <pn(x)  is  integrable  and 
/  <p„(x)  dp  <  fn(x)  dp.  <  K. 

J  A  J  A 
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Finally, 

and 


<Pl(x)  <  <p2(x)  <  •••  <  <Pn(x)  <  ■, 


lim  <pn(x)  =  f{x) 

n-*co 


almost  everywhere.  Therefore,  applying  the  preceding  result  to 
{<pn(x)\,  we  obtain  the  required  result. 


Theorem  A.  If  A  =  U  An,  A,  D  A  r  =  0  for  i  0  j  and  the  series 

n 


E  f  \f(x)\dn  (2) 

n  JA„ 

converges,  then  the  function  f(x)  is  integrable  on  A  and 
f  }{x)  dfi  =  E  f  fix)  dfi. 

JA  n  JAn 

New  here,  as  compared  to  Theorem  6  of  §11,  is  the  assertion 
that  from  the  convergence  of  the  series  (2)  it  follows  that  fix) 
is  integrable  on  A. 

Let  us  first  give  the  proof  for  the  case  of  a  simple  function 
which  takes  on  the  values  /,•  on  the  sets  B(.  Setting 


we  have 


Ani  —  A  n  D  B  i, 


\  \f(x)\d»  =  El/.'UUnd. 

j  A  ..  i 


From  the  convergence  of  the  series  (2)  there  follows  the  con¬ 
vergence  of  the  series 


E  EI/,|M(dni)  =  EI/iU(fli  n  A). 

n  i  i 

The  convergence  of  the  last  series  means  that  the  integral 


f  <p{x)  dn  =  E/.-M(B.-n  A) 

J  A  4 


exists. 
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In  the  general  case  we  approximate  the  function /(z)  by  the 
function  /(x)  in  such  a  way  that 

I/O)  -  /O)  I  <  S-  (3) 

Then 


f  I/O)  \  dn  <  f  \f(x)\dv.  +  sn(An), 

J  4  _  4 


and,  since  the  series 

X)  M(d„)  =  /i(d) 


converges,  the  convergence  of  the  series  (2)  implies  the  con¬ 
vergence  of  the  series 


E  f  I  /O)  I  d/x, 

n  JA„ 

i.e.,  by  what  has  been  just  proved,  the  integrability  over  A  of  the 
simple  function /(t).  But  then,  by  (3),  the  initial  function /(x) 
is  also  integrable  over  A. 


1 3.  Comparison  of  the  Lebesgue  Integral  and  the  Riemann  Integral 


Let  us  clarify  the  relation  between  the  Lebesgue  integral  and 
the  usual  Riemann  integral.  Here  we  shall  limit  ourselves  to  the 
simplest  case  of  a  linear  Lebesgue  measure  on  a  line. 


Theorem.  If  the  Riemann  integral 


J  = 


dx, 


exists,  then  f{x)  is  Lebesgue  integrable  on  [a,  5]  and 


=  J. 
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Proof.  Let  us  consider  the  subdivision  of  [a,  6]  into  2"  parts  by 
the  points 

k 

xk  =  a  +  — (b  —  a) 

2" 

and  the  Darboux  sums,  which  correspond  to  this  subdivision : 


S 


n 


b  -  a  * 

*=i 


where  Mnk  is  the  upper  bound  of  f(x)  on  the  segment 

Xk- 1  <  X  <  Xk, 

and  is  the  lower  bound  of  f(x)  on  the  same  segment.  By  the 
definition  of  the  Riemann  integral, 

J  =  lim  Sn  =  lim  Sn. 

n-*co  n-*-co 

Let  us  set 

fn(x)  =  Mnk  for  Xk-1  <  X  <  Xk, 
fn(x)  =  m„k  for  Xk- 1  <  x  <  xk. 

At  the  point  x  =  b  the  functions  /„  and  /„  can  be  defined  arbi¬ 
trarily.  It  is  easy  to  compute  that 

f  fn(x)  dfi  =  Sn, 

J  [a, b] 


Since  the  sequence  {/„}  does  not  increase,  and  the  sequence  {/„! 
does  not  decrease,  we  have  almost  everywhere 


fn(x)  ->  f(x)  >  f{x), 


ln(x)  -*  fix)  <  fix). 
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By  Theorem  2,  §12, 

/  f(x)  dn  =  lim  Bn  =  J  =  lim  Sn=  [{x)  dfi. 

•'[a, 6]  n-*oo  n-»oo 


Therefore 


f  |/0*0  -  f(x)  \dn  =  [  {/Or)  - 

and,  hence,  almost  everywhere 

f(x)  -  l(x)  =  0, 


f0*0  }  dn  =  0, 


i.e. 


f(x)  =  l(x)  =  /(x), 
f  f(x)  dfi  =  f  f(x)  dfi  =  J. 

J  [a  M  J  M 


The  theorem  is  proved. 

It  is  easy  to  give  examples  of  bounded  functions  which  are 
Lebesgue  integrable  but  which  are  not  Riemann  integrable  (for 
example,  the  Dirichlet  function,  mentioned  earlier,  which  is  equal 
to  unity  for  rational  x  and  equal  to  zero  for  irrational  x). 

In  general,  unbounded  functions  cannot  be  Riemann  integrable, 
however  many  of  them  are  Lebesgue  integrable.  In  particular, 
any  function  f(x)  for  which  the  Riemann  integral 


j  \f(x)\dx 


has  a  finite  limit  J  as  s  — >  0  is  Lebesgue  integrable  on  [0,  1]; 
moreover, 


/  f{x)  dfi  =  lim  /  f(x)  dx. 
•'[0,1]  '  £-*0 


In  connection  with  this  it  is  interesting  to  mention  that  the 
improper  integrals 


f  }{x)  dx  =  lim  f  f(x) 
Jo  1-+ 0  J  i 


dx 
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are  not  Lebesgue  integrable  in  the  case  when 

lim  /  \}{x)\dx  =  00  . 

£-*•0  "  £ 

Lebesgue  integration  has  always  an  absolute  character  in  the 
sense  of  Theorem  8,  §11. 


14.  Direct  Products  of  Systems  of  Sets  and  Measures 

An  important  role  in  analysis  is  played  by  theorems  which  re¬ 
duce  double  (or  generally  multiple)  integrals  to  iterated  inte¬ 
grals.  The  basic  result  in  the  theory  of  multiple  Lebesgue  integrals 
is  the  so-called  Fubini  theorem  which  will  be  proved  in  §16.  We 
shall  preliminarily  establish  some  useful  concepts  and  facts  which, 
by  the  way,  are  also  of  independent  interest. 

The  set  Z  of  ordered  pairs  ( x ,  y),  where  if  I  and  y  £  Y,  is 
called  a  direct  product  of  the  sets  X  and  Y  and  is  denoted  by 
Z  -  X  X  Y.  Analogously,  the  set  U  of  ordered  finite  sequences 
(Xi,  x2,  •  •  •,  x„),  where  xk  Xk,  is  called  the  direct  product  of 
the  sets  Xh  X2,  ■  •  Xn,  and  is  denoted  by 

Z  =  X 1  X  X2  X  •  •  •  X  x„  =  |x] xk. 

In  the  special  case  when 

X,  =  X2  =  •  •  •  =  Xn  =  X, 

the  set  Z  is  the  n-th  power  of  the  set  X : 

Z  =  Xn. 

For  example,  the  n-dimensional  coordinate  space  Dn  is  the  n-th 
power  of  the  real  axis  Dl.  The  unit  cube  Jn,  i.e.,  the  set  of  ele¬ 
ments  of  Dn  with  coordinates  which  satisfy  the  inequality 

0  <  xk  <  1,  k  =  1,2,  •  •  • ,  n, 
is  the  n-th  power  of  the  unit  segment  J1  =  [0,  1  ]. 
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If  ©i,  @2,  •  ■  -  ,  @„  are  systems  of  subsets  of  the  sets  Xh  X2, 
■  ■ ,  Xn,  then 

SS  =  6i  X  Si  X  ■  ■  ■  X  @„ 


denotes  the  system  of  subsets  of  the  set  X  =  |  X  |  Xk,  which  are 
given  in  the  form 

A  =  Ai  X  A2  X  X  A„, 


where  Ak  €  ©*• 

If  @1  =  @2  =  •  •  ■  =  ©„=©,  then  St  is  the  n-th  power  of  the 
system  @: 

9?  =  @». 

For  example,  the  system  of  parallelopipeds  in  T)n  is  the  n-th 
power  of  the  system  of  segments  in  D'. 

Theorem  1 .  If  ©i,  ©2,  •  ■  ■ ,  ©„  are  semirings,  then  fR  =  |  X  |  ©* 
is  also  a  semiring. 

Proof.  In  accordance  with  the  definition  of  a  semiring  (§2),  we 
have  to  show  that  if  A,  B  6  9t,  then  A  n  B  £  9?,  and  if,  moreover, 

m 

B  c  A,  then  A  =  U  C<,  where  C 1  =  B,  Ci  n  C ,■  =  0  for  i  ^  j 

i—1 

and  C,  £  9t  (i  =  1,2,  •  •  •,  m). 

Let  us  give  the  proof  for  the  case  n  =  2. 

I)  Let  A  6  @1  X  ©2,  0i  X  ©2;  this  means  that 


Then 

and  since 


A  —  A 1  X  A  2,  A 1  £  ©1,  A  2  (E  ©2, 

B  =  Bi  X  B2,  B\  6  ©1,  B2  (E  ©2- 

A  n  B  =  (di  n  Bi)  x  (d2  n  B2), 


Ai  n  Bi  (E  ©1,  A  2  n  b2  ie  @2, 
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we  have 

A  n  B  £  X  @2. 

II )  Let  now,  in  the  assumptions  of  paragraph  I ),  B  Q  A.  Then 
B1  c  Ai,  B2  C  At, 

and,  because  @1  and  ©2  are  semirings,  we  have  the  decompositions 
di  =  5,  u  u  •  •  u  BP, 

d2  =  B2  u  £2«>  u  •  •  •  U  £2(i>, 

,4  =  Ai  X  d2  =  (5,  X  Bt)  U  (Bi  X  B2(1>)  U  •  •  •  U (Bi  X  £2<!)) 

U  (BP  X  B2)  U  (B,o>  X  B2<«)  U  ...  U  (£,<»  X  BP) 


U  (BP  X  B2)  u  (BP  xft)  U  •  •  •  u  (Bi»  X  £2<!>). 


In  the  last  decomposition  the  first  term  is  B,  X  B2  =  B  and  all 
terms  belong  to  the  system  @i  X  ©2.  The  theorem  is  proved. 

However,  from  the  assumption  that  the  systems  <&k  are  rings 
or  Borel  rings,  it  does  not  yet  follow  that  the  product  |  X  |  ©*  is  a 
ring  or  a  Borel  ring,  respectively.  k 

Let  the  measures 

Mi(di),  /x2(d2),  •  •  • ,  fji„(An) ,  Ak  €  ©i, 
be  given  on  the  semirings  @i,  ©2,  •  •  •, 

Let  us  define  on 


the  measure 


SB  =  ®iX@jX  X  ©„ 


M  =  MI  X  M2  X  •  •  •  X  Mn 


by  the  condition:  if  A  =  Ai  X  A2  X  •  •  •  X  An,  then 


m(A)  =  mi(Ai) M2(A2) 
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We  still  have  to  show  that  n(A)  is  a  measure,  i.e.,  that  n(A)  is 
additive.  We  shall  do  this  for  the  case  n  =  2.  Assume  the  de¬ 
composition 

A  =  Ai  X  A2  =  U  B(k\  Bi0  n  B =  0  for  i  ^  j, 

=  Bitk)  x 

given.  As  was  shown  in  §2,  there  exist  decompositions 
Ar  =  U  A2  =  U  C2<»> 

m  n 

such  that  the  sets  B,(k)  are  unions  of  some  C i(m),  and  the  sets 
Bi(k)  are  unions  of  some  C2(m) .  Obviously, 

n(A)  =M1  (A,)M2(d2)  =  E  Emi(Ci‘"))w(C,<-»),  (1) 

m  n 

m(B(W)  =  MiCftWMA®)  =  E  Emi(C1<”»)m2(C2('*>);  (2) 

m  n 

moreover,  all  the  terms  which  appear  in  the  right-hand  side  of 
equation  (2)  appear  once  in  the  right-hand  side  of  equation  (1 ). 
Therefore, 

/x(d)  =  E  m(£W), 

it 

wtich  was  to  be  proved. 

Thus,  in  particular,  the  additivity  of  elementary  measures  in 
the  n-dimensional  Euclidean  space  follows  from  the  additivity  of 
the  linear  measure  on  a  line. 

Theorem  2.  If  the  measures  mu  M2,  •  •  •,  are  a-additive,  then  the 
measure  m  =  mi  X  M2  X  •••  X  is  also  tr-additive. 

Proof.  Let  us  prove  the  theorem  for  the  case  n  =  2.  Let  us  de- 

'  ao 

note  by  Xi  the  Lebesgue  space  of  measure  mi-  Let  C  =  U  C„,  where 
the  sets  C  and  C„  belong  to  X  ©2,  i.e.,  n_1 

C  =  A  X  B,  A  <E  ©!,  B  e  @2, 

Cn  =  An  X  Bn,  An  €  @1,  £  @2. 
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Let  us  set,  for  x  £  A^, 

f  M2  ( -fL )  if  x  c  An, 

/.(i)  = 

(o  if  I  $  An. 

It  is  easy  to  see  that  for  x  6  A, 

Zfn(x)  =  /I2  (B). 

n 

Therefore,  by  the  corollary  to  Theorem  2,  §12, 

E  f  /»(*)  dX,  =  [  M2 (B)  d»i(A )  =  m(C); 

n  •'A  •'A 

but 

f  fn(x)  d\  1  =  M2(f?n)Ml(A„)  =  n(Cn), 

JA 

and,  consequently 

Em(C„)  =  m(C). 

The  Lebesgue  continuation  of  the  measure  mi  X  mi  X  •  •  •  Xm» 
we  shall  call  the  product  of  the  measures  m*  and  denote  by 

Ml  ®  M2  <8>  •  •  •  <8>  Mn  =  I  <8>  I  Mfc- 

In  the  case 

Ml  =  M2  =  •  •  •  =  Mn  =  M, 

we  obtain  the  concept  of  a  power  of  the  measure  m  : 

n 

Mn  =  |  <S>  I  M*,  Mi  =  M' 
k=l 

For  example,  the  n-dimensional  Lebesgue  measure  Mn  is  the 
n-th  power  of  the  linear  Lebesgue  measure  m1- 

15.  Expressing  the  Plane  Measure  by  the  Integral  of  a  Linear 
Measure  and  the  Geometric  Definition  of  the  Lebesgue  Integral 

Let  a  domain  G  in  the  (x,  y  )-plane  be  bounded  by  the  vertical 
lines  x  =  a,  y  =  b  and  by  the  curves  y  =  <p(x),  y  =  ${x). 
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As  is  well  known,  the  area  of  the  domain  G  is  equal  to  the 
integral 

V(G)  =  f  { <p(x)  -  *(*)}  dx. 

" a 

Moreover,  the  difference  <p(x0)  -  <P(x0)  is  equal  to  the  length  of 
the  intersection  of  the  domain  G  with  the  vertical  line  x  =  x0.  Our 
task  is  to  carry  over  such  a  method  of  measuring  areas  to  arbi¬ 
trary  measure  products 

Here  we  shall  assume  that  the  measures  Hz  and  yv  are  defined 
on  Borel  rings,  are  v-additive  and  have  the  closure  property 
(if  B  C  A  and  y(A)  =0,  then  B  is  measurable)  which,  as  was 
shown  earlier,  is  possessed  by  all  Lebesgue  continuations. 

Let  us  introduce  the  notation: 

Az  =  { y:(x,  y)  e  A], 

Av  =  \x:(x,y)  e  A}. 

If  X  and  F  are  real  lines  (and  X  X  Y  a  plane),  then  AZo  is  the 
projection  of  an  intersection  of  the  set  A  and  a  vertical  line 
x  =  x0  onto  the  F-axis. 

Theorem  1.  Under  the  assumptions  enumerated  above,  for  any 
n-measurable  set  A*, 

M(A)  J  tXy(Ax)  dll  jr  J  [Ix(Ay')  dfly. 

Proof.  It  is  obvious  that  it  suffices  to  show  the  equality 


ti(A)  =  /  <pA(x)  dyx,  where  <pA(x)  =  tiy(Ax),  (1) 

J  x 

since  the  second  part  of  the  assertion  of  the  theorem  is  quite 
analogous  to  the  first.  Let  us  note  that  the  theorem  automatically 


*  Note  that  integration  over  X  is  in  fact  reduced  to  integration  over  the  set 


U  Ay  C  X,  outside  of  which  the  integrand  is  equal  to  zero.  Analogously, 

V 
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includes  the  statement  that  for  almost  all  x  (in  the  sense  of  the 
nx  measure)  the  sets  Ax  are  measurable  with  respect  to  the  meas¬ 
ure  ny,  and  that  the  function  <pA(x)  is  measurable  with  respect 
to  the  measure  nx.  Otherwise  formula  (1 )  would  not  make  sense. 

The  measure  /x  is  a  Lebesgue  continuation  of  the  measure 


m  —  /lx 


which  is  defined  on  the  system  Sm  pf  sets  of  the  form 

A  =  A  yQ  X  Axq- 

For  such  sets  equation  (1 )  is  obvious,  since  for  them 

f  ( A  x0)  for  x  (z  A  yoj 

<Pa(x)  =  | 

[0  for  i  $  4»t. 

It  is  easy  to  see  that  equation  (1 )  applies  also  to  sets  from 
9t()Sm),  which  decompose  into  a  finite  number  of  pairwise  non¬ 
intersecting  sets  from  Sm. 

The  proof  of  equation  (1 )  for  the  general  case  is  based  on  the 
following  lemma,  which  is  also  of  independent  interest  for  the 
theory  of  Lebesgue  continuations. 

Lemma.  For  any  n-measurable  set  A  there  exists  a  set  B  of  the  form 
B  =  0  Bn,  B,  3  B2  D  •  •  •  D  Bn  2  •  •  •, 

n 

Bn  =  U  Bfi/cy  Bnl  C  B„2  C  ...  QBnkQ  ■  • 

k 

where  the  sets  Bnk  belong  to  9?  (Sm ) ;  moreover,  A  c  B  and 

»(A)=»(B).  (2) 

The  proof  of  the  lemma  is  based  on  the  fact  that,  by  the  defini¬ 
tion  of  measurability,  for  any  n  the  set  A  can  be  included  in  the 
union  C„  =  u  A  nr  of  the  sets  Anr  of  Sm  in  such  a  way  that 

r 

n{C„)  <  ix{A)  +  -. 

n 
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Setting Bn  =  f!  it  is  easy  to  see  that  the  sets  B„  will  have 
the  form  Bn  =  U  5„s,  where  Sng  belongs  to  Sm,  Finally,  setting 

k  s 

Bnk  =  U  5na,  we  obtain  the  system  of  sets  Bnk  required  by  the 

s=l 

lemma. 

It  is  easy  to  carry  over  equation  (1 )  from  the  sets  B„k  €  (>Sm) 

to  the  sets  Bn  and  B  with  the  help  of  Theorem  2,  §12,  since 

VbSsB)  =  Ih11  ‘PBnk(X)>  VBnl  ^  PB-nt  < 

k-*<® 

pBix)  =  lim  <pBn(x)>  Vb\  ^  <Pbi  > 

n-»oo 

If  n(A)  =0,  then  n{B)  =0  and  almost  everywhere 
<Pb(x)  =  Hv(Bx)  =  0. 


Since  Az  c  Bx,  the  set  Az  is  measurable  for  almost  all  x,  and 

<Pa(x)  =  Hv(Az)  =  0, 

J  <Pa(x)  dfxz  =  0  =  n(A). 

Consequently,  for  the  sets  A  for  which  n(A  )  =0,  formula  (1 )  is 
true.  Let  us  represent  A  if  it  is  arbitrary  in  the  form  A  =  B\C, 
where,  by  (2), 

m(C)  =  0. 

Since  formula  (1 )  holds  for  the  sets  B  and  C,  it  is  easy  to  see  that 
it  also  holds  for  the  set  A  itself. 

The  proof  of  Theorem  1  is  complete. 

Let  us  now  consider  the  special  case  when  Y  is  the  real  axis, 
fiy  a  linear  Lebesgue  measure  and  the  set  A  the  set  of  points 
( x ,  y )  of  the  form 

x  €  M,  0  <  y  <  fix),  (3) 

where  M  is  some  ^-measurable  set  and  f(x)  an  integrable  non- 
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negative  function.  In  this  case 


and 


f  f(x)  for  x  d  M, 

Vv(Ax)  = 

(0  for  x  (£  M, 

mM)  =  f  f(x)  dnz. 


We  have  proved  the  following 

Theorem  2.  The  Lebesgue  integral  of  a  noru-negative  function  f(x) 
is  equal  to  the  measure  n  =  nz  X  nv  of  the  set  A,  defined  by  rela¬ 
tion  (3). 

In  the  case  when  X  is  also  the  real  axis,  the  set  M  is  a  segment, 
and  the  function  f(x)  is  Riemann  integrable,  this  theorem  re¬ 
duces  to  the  known  fact  that  the  integral  can  be  expressed  as  the 
area  under  the  graph  of  the  function. 


16.  Fubini's  Theorem 

Let  us  consider  a  triple  product  of  the  form 
U  =  X  X  Y  X  Z. 

We  shall  identify  the  point 

(x,  y,z)  €  U 

with  the  points 

((*>  y),  z), 

(x,  (: V ,  z) ) 

of  the  products 

(X  X  Y)  X  Z, 

X  X  (F  X  Z). 


(1) 


(2) 

(3) 


With  this  understanding  one  can  consider  the  products  (1), 
(2 )  and  (3 )  as  one  and  the  same  thing. 
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If  the  measures  hx,  Hy,  Hz  are  given  on  X,  Y,  Z,  then  the  measure 

Hu  =  fix  ®  Hv  ®  Hz 

can  be  defined  either  as 


Hu  =  (Hz  ®  Hv )  ®  Hz, 

or  as 

Hu  =  Hz  ®  ( Hv  ®  Hz)- 

The  rigorous  proof  for  the  equivalence  of  these  definitions  is 
omitted  here,  although  it  is  not  difficult. 

We  shall  give  the  application  of  these  general  constructions  to 
the  proof  of  the  basic  theorem  of  the  theory  of  multiple  integrals. 

Fubini's  Theorem.  Let  the  measures  Hz  and  h«  be  defined  on 
Borel  rings,  u-additive  and  complete ;  let,  moreover, 


H  =  Hz  ®  Hv, 


and  let  the  function  f(x,y)  be  integrable  with  respect  to  the  measure 
h  on  the  sets 


Then * 


A  -  AyQ  X  Az(t- 


Jj(x>  y)  dH  =  fx  /(*>  y )  dn)j  dtiz  =  (^f  f(x,  y)d>i}j  dny.  (4) 

Proof.  The  assertion  of  the  theorem  includes  the  existence  of 
the  inner  integrals  in  parentheses  for  almost  all  values  of  the 
variable,  with  respect  to  which  the  integration  in  the  parentheses 
is  taken. 

Let  us  first  give  the  proof  for  the  case  f(x,y)  >  0.  For  this 
purpose  let  us  consider  the  triple  product 

U  =  X  X  Y  X  D\ 

where  the  third  factor  is  the  rfeal  axis,  and  the  product  of  measures 

\  —  Hz  ®  Hv  ®  H1  =  H  ®  H1, 

where  h1  is  a  linear  Lebesgue  measure. 


*  See  footnote  on  page  83. 
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In  U  we  define  the  subset  W  by  the  condition 

(x,  y,  z)  6  W, 


if 


X  €  Ay,  ye  Ax, 

0  <  2  <  f{x,  y). 

By  Theorem  2  of  §15, 

A  (WO  =  f  fix,  y)  dp.  (5) 

J  A 

On  the  other  hand,  by  Theorem  1  of  §15, 

A(W0  =  f  ZiWz)  dpz,  (6) 

Jx 

where  {  =  /i,  X  |i‘  and  Wz  denotes  the  set  of  pairs  (y,  z )  for  which 
ix,  y,  z)  e  W.  Here,  by  Theorem  1  of  §14, 


S(W, 


-/ 


fix,  y)  dpu. 


Combining  (5),  (6)  and  (7)  we  obtain 


(7) 


f  fix,  y)  dp  =  j  (f  fix,  y)  dpvJ 

A  X  \  A*  / 


dpz 


which  was  to  be  proved. 

The  general  case  is  reduced  to  the  case  fix,  y)  >  0  with  the 
help  of  the  equations 


f+(x, 


fix,  y)  =  f  +  ix,  y)  —  f~ix,  y), 

\fix,y)  \  +  fix,  y)  r_,  s  \fix,y)\  -fix,y) 

- o - ,  /  (*,  V )  =  - - -T - 


Remark  1 .  One  can  show  that  if  the  function  fix,  y)  is  ^-meas¬ 
urable  the  integral 


/ 


fix,  y)  dp 
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exists,  provided 


exists. 


\{j  1-^’  dfiz 


Examples  showing  when  equation  (4)  does  not  hold: 
1 )  Let 

A  =  [-1,  lj 


f(x,y)  = 


(x2  +  y2Y 


for  y  0,  and 


for  i^O.  Therefore 


f  f(x,  y)  dx  =  0 
*/— 1 

f  fix,  y)  dy  =  0 


f  i  (/  ^X‘  dx)  dy  =  /  (/  ^  ^  rf2/)  dx  = 

however,  the  integral  does  not  exist  in  the  sense  of  a  Lebesgue 
integral  over  the  square,  since 

f  f  | lf(*,«)\dzdy  >  fdr  r™*™l!dr  -if*-.. 

i  i  •'o  /■  •'o  r 


2)  A  =  [0,  l]2, 


fix,  y ) 


22n  for  —  <  x  <  - ,  —  <  y  <  - 

2»  2n_1  2n  **  2n_1 

-  22n+1  for  —  <  z  <  — ,  —  <  y  <  — , 

Qn+1  2n?  2n  ”  9n — 1* 


0  for  all  other  cases. 
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One  can  compute  that 


f(f  fix,  y )  dxj  dy 


0, 


17.  The  Integral  as  a  Set  Function 

Let  us  consider  the  integral  F(  A )  =  /  f(x)dx  as  a  set  function, 

J  A 

assuming  that  Sh  is  a  Borel  algebra  with  unit  X  and  that 

/  f(x)dx  exists.  Then,  as  was  shown  earlier: 

J  x 

1.  F(  A )  is  defined  on  a  Borel  algebra 

2.  F{A)  is  real; 

3.  F{A)  is  additive,  i.e.,  for  every  decomposition 

A  =  [}  An 

n 

of  the  set  A  £  S?  into  the  sets  An  £  S„ 

F(A)  =  EF(d„); 


4.  F(A)  is  absolutely  continuous,  i.e.,  y{A)  =0  implies 
F(A)  =  0. 

Radon's  Theorem.  If  the  function  F(A)  possesses  the  proper¬ 
ties  1,  2,  3  and  4,  then  it  may  be  represented  in  the  form 

F(A)  =  f  f(x)  dy. 

J  A 

We  shall  show  that  the  function/  =  dF /dy  can  be  uniquely 
determined  up  to  values  on  a  set  of  measure  zero.  Indeed,  if  for  all 
A  (z  S  M 


F(A) 


/  fi(x)  dy  =  /  f2( x)  dy, 

•>  A  "  A 
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then  for  any  n  for  the  sets 
A„  = 

we  have 

n(An)  <  n 

Analogously,  for 

Bn  = 

we  have 

fj.  (Bn)  =  0. 

Since 

\x:fi  ^  h\  =  U  An  U  U  Bm, 

n  m 

n{x:fi  7*  f2\  =  0. 

Our  assertion  is  proved. 


x-.fi  —  fl  > 


n 


(  (h~h)dn=  0. 

*  A  _ 


x:fi  -  fi  > 
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FUNCTIONS  WHICH  ARE  SQUARE 
INTEGRABLE 


One  of  the  most  important  spaces  among  the  various  linear 
normed  spaces  which  are  encountered  in  functional  analysis  is  the 
Hilbert  Space.  Its  name  is  due  to  the  German  mathematician 
D.  Hilbert  who  introduced  this  space  in  connection  with  investi¬ 
gations  in  the  theory  of  integral  equations.  It  is  a  natural  infinite 
analogue  of  the  n-dimensional  Euclidean  space.  We  have  already 
made  acquaintance  with  one  of  the  possible  generalisations  of 
Hilbert  space  in  Chapter  III  of  Volume  I,  it  is  the  space  l2  whose 
elements  are  sequences  of  numbers 

X  =  ( Xl ,  X2,  Xn,  •••), 

which  satisfy  the  relation 

n= 1 

The  concept  of  the  Lebesgue  integral  allows  us  to  introduce 
another,  in  some  cases  more  convenient,  realisation  of  the  same 
space — the  space  of  square  integrable  functions.  In  this  chapter 
we  shall  consider  the  definition  and  the  basic  properties  of  the 
space  of  square  integrable  functions  and  establish  that  it  is  iso¬ 
metric  (with  corresponding  assumptions  about  the  measure  with 
respect  to  which  the  integration  is  performed)  to  the  space  l2. 

In  the  next  chapter  we  shall  give  an  axiomatic  definition  of 
Hilbert  space. 

1 8.  The  Lj  Space 

Below  we  shall  consider  functions  f(x),  defined  on  some  set  R 
for  which  a  measure  n(E)  is  given  satisfying  the  condition 
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ft( R )  <  oo.  The  functions  f(x)  are  assumed  to  be  measurable  and 
defined  almost  everywhere  on  R.  We  shall  not  distinguish  be¬ 
tween  functions  which  are  equivalent  on  R.  Instead  of  f  we 
shall,  for  the  sake  of  conciseness,  write 

Definition  1.  One  says  that  the  function /(t)  is  square  integrable 
(or  summable )  over  R  if  the  integral 

J/2(x)  dn 

exists  (i.e.,  is  finite).  The  set  of  all  functions  which  are  square 
integrable  over  R  we  shall  denote  by  L2. 

We  shall  now  establish  the  basic  properties  of  such  functions. 

Theorem  1 .  The  product  of  two  square  integrable  functions  is  an 
integrable  function. 

The  proof  follows  immediately  from  the  inequality 

!/<*«*)  I 

and  the  properties  of  the  Lebesgue  integral. 

Corollary.  Every  square  integrable  function  f(x)  is  integrable. 
Indeed,  it  suffices  to  set  g(x)  =  1  in  Theorem  1. 

Theorem  2.  The  sum  of  two  L2  functions  is  also  in  L2. 

Proof.  Indeed, 

(/(z)  +  g{x)Y  < /20*0  +  2|/(x)gr(a:)  |  +  g2(x), 

and,  by  Theorem  1,  each  of  the  three  functions  on  the  right  is 
summable. 

Theorem  3.  If  fix)  £  L2  and  a  is  an  arbitrary  number,  then 
af(x)  £  L2. 
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Proof.  If  /  £  L2,  then 

J  (af(x))2  dii  =  a2  J  f2(x)  dn  <  ». 

Theorems  2  and  3  show  that  linear  combinations  of  L2  func¬ 
tions  again  belong  to  L2;  moreover,  it  is  clear  that  sums  of  L2 
functions  and  their  products  with  numbers  satisfy  all  the  condi¬ 
tions  1-8,  enumerated  in  the  definition  of  a  linear  space  (Chapter 
III,  §24  of  Volume  I),  in  other  words,  the  class  L2  of  square 
integrable  functions  is  a  linear  space. 

Let  us  now  define  the  scalar  product  of  L2  functions  by  setting 

U,g)  =  f  f(x)g(x)  d/i.  (1) 

JR 

As  is  well  known,  one  understands  by  a  scalar  product  any  real 
function  of  a  pair  of  vectors  of  the  linear  space  which  satisfies  the 
following  conditions : 

1)  (f,g)=(g,f), 

2)  ( fi  +  fa,  g )  =  (fa,g)  +  ( fa,  g) , 

3)  W,g)=Hf,g), 

4)  (/,/)  >o  if  f*  o. 

From  the  basic  properties  of  the  integral  it  immediately  follows 
that  expression  (1 )  does  indeed  satisfy  conditions  1-3.  Moreover, 
since  we  have  agreed  not  to  distinguish  between  functions  which 
are  equivalent  (and  in  particular,  to  take  as  a  unit  element  the 
set  of  all  functions  in  R  which  are  equivalent  to  f(x)  =  0),  con¬ 
dition  4  is  also  satisfied  (see  the  corollary  to  Theorem  9,  §11). 
This  leads  us  to  the  following  definition. 

Definition  2.  By  L2  space  one  understands  a  Euclidean  space,* 
the  elements  of  which  are  classes  of  equivalent  square  integrable 
functions;  addition  of  elements  and  their  multiplication  by  num¬ 
bers  are  defined  as  the  usual  addition  and  multiplication  of 
functions,  and  the  scalar  product  by  the  formula 

(.fag)  =  f  }(x)g(x)  dii.  (1) 


*  A  Euclidean  space  is  a  linear  space  in  which  a  scalar  product  has  been  introduced. 
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In  Li,  as  in  every  Euclidean  space,  the  Cauchy-Bunjakovskii* 
inequality,  having  in  this  case  the  form 

{j  f(x)g(x)  dfij  <  J  }2(x)  dfi-  J  g2(x)  dfi,  (2) 

and  the  triangle  inequality,  having  the  form 

yj  j  (fix)  +  g(x)Y  dfi  <  -J  J  f2(x)  dfi  +  yjj  g2(x)  dn,  (3) 
are  satisfied. 

In  particular,  for  g(x)  =  1,  the  Cauchy-Bunjakovskii  inequality 
implies  the  following  useful  inequality : 

(/ fix)  dfij  <  n(R)  f  f2(x)  dn.  (4) 

Let  us  introduce  a  norm  in  L2,  setting 

11/ II  =  VUJ)  =  ■sjfpix)  ft  Li.  (5) 

Exercise.  Assuming  properties  1-4  of  a  scalar  product,  show 
that  the  norm,  defined  by  equation  (5),  satisfies  conditions  1-3 
of , the  definition  of  the  norm  (§21  of  Volume  I ). 

The  following  theorem  plays  a  very  important  role  in  many 
problems  of  analysis : 

Theorem  4.  The  L2  space  is  complete. 

Proof,  a)  Let  {fn(x) )  be  a  fundamental  sequence  in  L2)  i.e.,  let 
Wfn  -/m||— >0  for  n,  m— »  ». 

Then  we  can  select  a  subsequence  of  indices  [nk\  in  such  a  way 
that 

I  I  f  nk  fnk+1  I  I  ^ 


Translator’s  Note:  This  is  usually  known  as  the  “Schwarz”  inequality. 
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This  implies,  by  inequality  (4 ),  that 

f  |  fnk(x)  -  fnk+1(x)  \dfi  <  [>(tf)]1/2jy  (fnk(x)  -  /„l+1  (x)  ) 2  d/lj 

<  jkMR)J12- 

This  inequality  and  Theorem  2,  §12  (Corollary)  yield  the  result 
that  the  series 

|/<u(z)|  +  I  fnt(x)  -  fm(x)  I  +  ■•■ 
converges  almost  everywhere  on  R.  Then  also  the  series 
/»lO)  T  f n%(x)  fn,ix))  +  •  •  • 

converges  almost  everywhere  on  R  to  some  function 

fix)  =  lim /„*(£).  (6) 

k~*  oo 

Thus  we  have  shown  that  if  the  sequence  {/„(  x )  (  is  fundamental 
in  L2,  then  one  can  always  find  a  subsequence  which  converges 
almost  everywhere. 

b)  Let  us  now  show  that  the  function  f(x)  given  by  equation 
(6)  belongs  to  L2  and  that 

||/„(z)  — /(x)||->0  for  n— >  «>.  (7) 

For  any  sufficiently  large  k  and  l  we  have 

j ( fnk(x )  -fm(x))2dn  <  e. 

Accordingly,  by  Theorem  3,  §12,  we  can  proceed  to  the  limit 
under  the  integral  sign  for  l  — *  «  in  this  inequality.  We  obtain, 

J  ( fnk{x )  -  f{x))2  dn  <  s, 

which  yields  /  6  L2  and  f„k  —*f.  But  from  the  fact  that  the  funda¬ 
mental  sequence  contains  a  subsequence,  converging  to  some 
limit,  it  follows  that  it  converges  to  the  same  limit.*  The  theorem 
is  proved. 

*  We  understand  here  by  convergence  the  fact  that  equation  (7)  is  satisfied; 
concerning  this  see  the  beginning  of  §19. 
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19.  Mean  Convergence.  Sets  in  Lt  which  are 
Everywhere  Complete. 


Introducing  a  norm  in  L2  we  have  also  introduced  a  certain  kind 
of  new  convergence  for  the  square  integrable  functions,  namely: 


means  that 


/„-»/  (in  Lf) 
lim  f  —  f(x)J  dfi  =  0. 


Such  a  convergence  of  functions  is  called  convergence  in  the 
mean  or,  more  exactly,  convergence  in  the  mean  square. 

Let  us  find  the  connection  between  the  concept  of  convergence 
in  the  mean  and  uniform  convergence,  as  well  as  the  concept  of 
convergence  almost  everywhere  which  we  introduced  in  Chapter 
II. 

Theorem  1.  If  the  sequence  {/„(£))  of  functions  in  L2  converges 
uniformly  to  fix),  thenf(x)  d  L2  and  { fn(x)\  converges  to  fix)  in 
the  mean. 

Proof.  Let  s  >  0.  If  n  is  sufficiently  large,  then 


| fn(x)  -  f{x )  |  <  e, 

whence 


J  if  nix)  -  fix))2  dfi  <  e?niR). 

This  inequality  immediately  yields  the  assertion  of  the  theorem. 

From  Theorem  1  it  follows  that  if  one  can  approximate  any 
function  /  6  i2  to  an  arbitrary  degree  of  accuracy  by  functions 
fn  €  M  Q  Li  in  the  sense  of  uniform  convergence,  then  one  can 
use  them  to  approximate  any  function  from  L2  also  in  the  sense 
of  convergence  in  the  mean. 

Hence  one  can  approximate  any  function/  £  L2  to  any  degree 
of  accuracy  by  simple  functions  belonging  to  L2. 
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We  shall  show  that  any  simple  function/  £  L2,  and  therefore, 
also  in  general  any  function  from  L2,  can  be  approximated  as 
closely  as  desired  by  simple  functions  which  take  on  only  a  finite 
number  of  values. 

Let  f{x)  take  on  the  values  y i,  — ,  y„,  ■  •  •  on  the  sets  Eh  •  •  •, 
En,  ■  •  • .  Since  /2  is  integrable,  the  series 

E  Vr?n{En)  =  ff(x)  dn 

n  J 

converges.  Let  us  select  a  number  N  in  such  a  way  that 


and  set 


tN(x) 


Then  we  have 


E  Vntl{En)  <  £, 

n>N 

fix )  for  x  €  E{,  i  <  N, 
0  for  x  £  Ei,  i  >  N. 


/  (/(z)  -  fN(x)y  dfi  =  E  yn2n(En)  <  s, 

i.e.,  the  functions  fN,  which  take  on  a  finite  number  of  values, 
approximate  the  function  /  as  closely  as  required. 

Let  R  be  a  metric  space  having  a  measure  which  satisfies  the 
following  condition  (satisfied  in  all  cases  of  practical  interest ) : 
all  open  and  all  closed  sets  in  R  are  measurable  and  for  any 
M  C  R, 

tx*(M)  =  inf  n(G),  (*) 

M9G 

where  the  lower  bound  is  taken  over  all  open  sets  G  containing 
M.  Then  the  following  theorem  holds. 

Theorem  2.  The  set  of  all  continuous  functions  is  complete  in  L2. 

Proof.  By  what  has  been  said  above  it  suffices  to  show  that 
every  simple  function  which  takes  on  a  finite  number  of  values 
is  a  limit,  in  the  sense  of  mean  convergence,  of  continuous  func¬ 
tions.  Furthermore,  since  every  simple  function  which  takes  on  a 
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finite  number  of  values  is  a  linear  combination  of  characteristic 
functions  xm(x  ')  of  measurable  sets,  it  suffices  to  give  the  proof 
for  these  latter  ones.  Let  M  be  a  measurable  set  in  the  metric 
space  R.  Then,  from  condition  ( *  )  it  immediately  follows  that 
for  any  e  >  0  one  can  find  a  closed  set  FM  and  an  open  set  Gm 
for  which 

F m  d  M  d  Gm  and  h(Gm)  —  p(F  m)  <  s. 

Let  us  now  define  the  function  <pc(x)  by  setting 

k,  ,  _  _ p(x,  R\Gm) _ 

p(x,  R\Gm)  +  p(x,  F m) 

This  function  equals  0  for  x  C  R\Gm  and  equals  1  for  x  £  F M- 
It  is  continuous  since  each  of  the  functions  p(x,  F M)  and 
p(x,  R\Gm)  is  continuous  and  since  their  sum  is  equal  to  zero. 
The  function  xm(x)  -  <pt(x)  is  not  greater  that  unity  on  GM\F 
and  equals  zero  outside  this  set.  Therefore 

J  (xm(x)  —  <pt(x))2  dp  <  £, 

which  yields  the  assertion  of  the  theorem. 

Theorem  3.  If  the  sequence  {/„(x)|  converges  to  f(x)  in  the  mean, 
then  one  can  select  from  it  a  subsequence  {f„k(x) )  which  converges  to 
f{x)  almost  everywhere. 

Proof.  If  the  sequence  j/„(x)}  converges  in  the  mean,  then  it  is 
fundamental  in  L2;  hence,  repeating  the  arguments  of  part  a)  of  the 
proof  of  Theorem  4,  §18,  we  see  that  one  can  select  from  {/„(x) }  a 
subsequence  {/„t(x))  which  converges  almost  everywhere  to 
some  function  <p(x).  Furthermore,  the  arguments  of  part  b)  of 
the  same  proof  show  that  {/n(t(x)|  converges  to  <p(x)  also  in  the 
mean,  which  yields  ^(x)  =  /(x)  almost  everywhere. 

It  is  not  difficult  to  convince  oneself  by  examples  that  the  convergence 
in  the  mean  of  some  sequence  does  not  imply  that  this  sequence  itself 
converges  almost  everywhere.  Indeed,  the  sequence  of  functions  f„k 
constructed  on  page  57  obviously  converges  in  the  mean  to  the  function 
f  =  0  but  still,  as  shown  there,  does  not  converge  everywhere.  We  shall 
now  show  that  convergence  almost  everywhere  (and  even  everywhere) 
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does  not  imply  convergence  in  the  mean.  Let 

n  for  x  (ifo,  , 

fn(x)  =  V  n/ 

0  for  all  other  values  of  x. 

It  is  clear  that  the  sequence  {/„(£)}  converges  to  0  everywhere  on 
[0,  l],  however  at  the  same  time 


/  fn(x)  dx  =  n  — »  oo. 

•'o 

Tchebishev’s  inequality  (§11,  Theorem  9)  implies  that  if  the  sequence 
converges  in  the  mean,  it  converges  in  measure.  Therefore,  Theorem  3 
which  we  proved  here  independently  follows  from  Theorem  4,  §9. 
The  relations  between  the  different  types  of  convergence  of  functions 
can  be  represented  by  the  following  scheme: 


Uniform  convergence 


I  .  .  .  I 


where  the  dotted  arrow  means  that  one  can  select  from  the  sequence 
which  converges  in  measure  a  subsequence  which  converges  almost 
everywhere. 


20.  Li  Spaces  with  a  Countable  Basis 

Generally  speaking,  the  L2  space  of  functions  which  are  square 
integrable  depends  on  the  choice  of  the  space  R  and  the  measure  /i. 
Its  full  notation  should  be  L2(R,  m)-  Only  in  exceptional  cases  is 
the  space  L2{R,  m)  finite  dimensional.  Of  more  importance  in 
analysis  are  those  spaces  L2(R,  n)  which,  in  a  certain  sense  to  be 
defined  later,  are  of  countable  dimension. 
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In  order  to  characterise  these  spaces  we  shall  need  yet  another 
concept  of  measure  theory. 

In  the  set  2ft  of  measurable  subsets  of  the  space  R  (we  assume 
its  measure  to  be  finite )  we  can  introduce  distance  by  setting 

p{A,  B)  =  p{A  A  B). 

If  we  identify  those  sets  A  and  B  for  which  p  (A  A  B )  =  0 
(i.e.,  in  other  words  we  consider  not  single  sets  but  classes  of  sets 
which  coincide  up  to  a  set  of  measure  zero ) ,  then  the  set  2ft  with 
the  distance  function  p  (A ,  B)  satisfies  all  the  axioms  of  a  metric 
space. 

Definition.  One  says  that  the  measure  p  has  countable  basis  if 
the  metric  space  2ft  contains  an  everywhere  countable  dense  set. 

In  other  words,  the  measure  p  has  a  countable  basis  if  there 
exists  a  countable  system 

A  =  {An},  n  =  1,  2,  •••  , 

of  measurable  subsets  of  the  space  R  (countable  basis  of  the 
measure  p)  such  that,  for  any  measurable  M  Q  R  and  any 
s  >  0,  one  can  find  an  Ak  £  A  for  which 

p(M  A  Ak)  <  e. 

In  particular,  the  measure  p  obviously  has  a  countable  basis  if  it 
can  be  represented  as  a  continuation  of  a  measure,  defined  on  a 
countable  system  of  sets  Sm.  Indeed,  in  this  case  the  ring  3t(Sm) 
(it  is  obviously  countable)  is,  by  Theorem  3,  §6,  the  required 
basis. 

In  particular,  the  Lebesgue  measure  on  a  segment  of  the  real 
axis  is  generated  by  a  system  of  intervals,  with  rational  end  points, 
as  elementary  sets.  Since  the  set  of  such  intervals  is  countable, 
the  Lebesgue  measure  has  a  countable  basis. 

The  product  p  =  /o  X  pi  of  two  measures  with  countable  basis 
also  has  a  countable  basis,  since  the  finite  sums  of  pairwise  prod¬ 
ucts  of  elements  of  the  basis  of  the  measure  pi  with  elements  of 
the  basis  of  the  measure  p2  form,  as  is  easily  checked,  the  basis  of 
the  measure  p  =  pi  X  pi.  Therefore  the  Lebesgue  measure  of  a 
plane  (and  also  an  ^dimensional  space)  has  a  countable  basis. 
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Let 

4!*,  A2*,  4,*,  ...  (1) 

be  the  countable  basis  of  the  measure  y.  It  is  easy  to  see,  that, 
enlarging  the  system  of  sets  (1),  one  can  form  a  countable  basis 
for  the  measure  y\ 

A  i,  An,  (2) 

which  satisfies  the  following  conditions: 

1 )  the  system  of  sets  (2 )  is  closed  with  respect  to  subtraction ; 

2)  the  system  of  sets  (2)  contains  R. 

From  conditions  1)  and  2)  it  follows  that  the  system  (2)  is 
closed  with  respect  to  a  finite  number  of  unions  and  intersections 
of  sets. 

This  follows  from  the  obvious  equalities 

A i  n  A2  =  Ai\(Ai\A2), 

AiUA2  =  R\{  (R\Ai)  n  (R\A»)). 

Theorem.  If  the  measure  y  has  a  countable  basis,  then  there  exists 
in  L2(R,  h)  an  everywhere  countable  dense  set  of  functions 

fh 

Proof.  As  such  a  basis  in  L2  ( R ,  /j)  we  can  select  the  finite  sums 

12  Ckfkix),  (3) 

A;=l 

where  ch  are  rational  numbers  and  fk(x)  are  the  characteristic 
functions  of  the  elements  of  the  countable  basis  of  the  measure  y. 

Indeed,  as  we  have  already  shown  in  the  preceding  section,  the 
set  of  step  functions  which  take  on  only  a  finite  number  of  differ¬ 
ent  values  is  everywhere  dense  in  L2.  Since  it  is  clear  that  any 
function  of  this  set  can  be  approximated  as  closely  as  required 
by  a  function  of  the  same  kind  but  taking  on  only  rational 
values,  and  since  the  set  of  functions  of  the  form  (3 )  is  countable, 
it  suffices,  in  order  to  prove  our  assertion,  to  show  that  any  step 
function  taking  on  the  values 


yi,  2/2,  •  •  • ,  yn,  •  •  •  (all  yi  are  rational) , 
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on  the  sets 

Eh  E2,  ,  U  Ei  =  R,  Ei  n  Ej  =  0  for  iV  j, 


can  be  approximated  as  closely  as  desired  in  the  sense  of  the  L2 
metric  by  a  function  of  the  form  (3).  By  this  remark,  one  can, 
without  loss  of  generality,  assume  that  our  basis  of  the  measure 
n  satisfies  conditions  1 )  and  2). 

By  the  definition  of  a  countable  basis  of  the  measure  p,  there 
exist,  for  any  s  >  0,  sets  Ai,  A2,  •••,  An,  •••  from  our  basis 
of  measure  p,  such  that  p{Ek,  Ak)  <  s,  i.e., 


yl(Ek\Ak)  u  (Ak\Ek)3  <  £• 


Let  us  set 


and  define 


d*'=d*\Udi,  A;  =  1,  2,  •••,  n, 

i<k 


f*{x )  = 


yk  for  x  €  Ak', 


0  for  x  €  ft\U  A  i. 

1=1 


It  is  easy  to  see  that  for  sufficiently  small  e  the  quantity 
p{x:f(x)  0  f* Or)} 

is  as  small  as  required  and,  therefore,  the  integral 

f  U(x)  -f*(x))2dp  <  (2  max  \yn\)2p{x:f(x)  0f*(x)\ 

is  arbitrarily  small  for  sufficiently  small  s. 

By  virtue  of  our  assumption  concerning  the  basis  of  the  meas¬ 
ure  p,  the  function  f*{x)  is  a  function  of  the  form  (3).  The 
theorem  is  proved. 

For  the  special  case  when  R  is  a  segment  of  the  real  line  and  p 
a  Lebesgue  measure,  the  countable  basis  in  L2(R)  can  also  be 
obtained  by  a  more  classical  method:  as  such  a  basis  we  could 
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take,  for  example,  the  set  of  all  polynomials  with  rational  co¬ 
efficients.  It  is  everywhere  dense  (even  in  the  sense  of  uniform 
convergence)  in  the  set  of  continuous  functions,  and  these  latter 
ones  form  an  everywhere  dense  set  in  L2(R,  n)- 

In  the  following  we  shall  limit  ourselves  to  the  consideration  of 
spaces  L2(R,  h)  which  have  a  countable  everywhere  dense  set 
(in  other  words,  are  separable — see  §9  of  Volume  I ). 

21 .  Orthogonal  Systems  of  Functions. 
Orthogonalisation. 

In  this  section  we  shall  investigate  functions  /  €  L2  which  are 
given  on  some  measurable  set  R  with  measure  n ;  we  shall  assume 
that  the  measure  has  a  countable  basis  and  satisfies  the  condition 
m(R)  <  <*>  ■  As  before,  we  shall  not  distinguish  between  equivalent 
functions. 

Definition  1 .  The  system  of  functions 

<pl  (x)  ,  <p2(x)  ,  '  •  •  ,  Vn(x)  (1) 

is  called  linearly  dependent,  if  there  exist  constants  ch  c2,  •  •  • ,  c„, 
not  all  equal  to  zero,  such  that 

ci<pi(x)  4 -  Ciwix)  4-  •”  +  c„<p„(z)  =  0  (2) 

almost  everywhere  on  R.  If,  however,  (2)  implies  that 

Cl  =  c2  =  •  •  •  =  cn  =  0,  (3) 

then  the  system  (1 )  is  called  linearly  independent. 

It  is  clear  that  a  linearly  independent  system  cannot  contain  a 
function  which  is  equivalent  to  'kfx)  =  0. 

Definition  2.  The  infinite  system  of  functions 

<P1  (x),  <p2(x),  •••,  <p„(x),  •••  (4) 

is  called  linearly  independent,  if  any  finite  part  of  it  is  linearly 
independent. 


21.  orthogonal  systems  of  functions 
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Let  us  denote  by 

M  =  M(<pi,  <P2,  <Pn,  •■•)  =  M{<pk) 

the  set  of  all  finite  linear  combinations  of  functions  of  the  system 
(4).  This  set  is  called  the  linear  closure  of  the  system  (4).  By 

M  =  <p2,  *•-,  <P„,  •••)  =  M{lpk) 

we  shall  denote  the  closure  of  the  set  M  in  the  space  L2.  M  is 
called  the  closed  linear  closure  of  the  system  (4). 

It  is  easy  to  see  that  the  set  M  consists  of  those  and  only  those 
functions  /  G  L2  which  can  be  approximated  by  finite  linear 
combinations  of  functions  of  the  system  (4)  with  a  prescribed 
accuracy. 

Definition  3.  The  system  of  functions  (4 )  is  called  complete,  if 
for  it, 

M  =  L2. 

Let  there  exist  in  the  space  L2  a  countable,  everywhere  dense 
set  of  functions 

f ij  ***>/”>  *  *  *  • 

Discarding  from  this  system  those  functions  which  are  linearly 
dependent  on  the  preceding  ones,  we  arrive  at  a  linearly  inde¬ 
pendent  system  of  functions 

Ql,  02,  gn,  ••• 

which,  as  is  easily  seen,  is  complete. 

In  the  case  that  there  exists  in  L2  a  finite  system  (1 )  of  linearly 
independent  functions, 

L2  =  M(tph  <p2,  •  •  -,  <Pn)  =  <p2,  •  •  •,  <P„ ) 

is  an  n-dimensional  Euclidean  space. 

In  all  cases  which  are  interesting  for  analysis  the  space  L2  is 
infinite  dimensional. 

It  is  obvious  that  the  system  (4)  is  complete  if  one  can  ap¬ 
proximate  each  of  the  functions,  belonging  to  some  set  which  is 
everywhere  complete  in  L2,  by  linear  combinations  of  functions 
belonging  to  the  system  (4 )  with  any  desired  degree  of  accuracy. 
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Let  R  =  [a,  6]  be  a  segment  of  the  real  axis  with  the  usual 
Lebesgue  measure.  Then  the  system 

1,  X,  X2,  •••,!",  •••  (5) 

is  complete  in  the  space  L2(R,  p). 

Indeed,  by  Weierstrass’  theorem,  the  linear  combinations  of 
functions  (5)  are  complete  in  the  set  of  all  continuous  functions. 
The  fact  that  the  system  (5)  is  complete  now  follows  from  our 
remark  and  Theorem  2,  §19. 

The  functions /(t)  and  g(x)  are  called  mutually  orthogonal,  if 
(f,g)  =  j  f(x)g(x)  dn  =  o. 

Every  system  <pi,  <p2,  •  •  •,  <pn,  •  •  -  of  functions  from  L2  which  are 
different  from  zero  and  are  pairwise  orthogonal  we  shall  call  an 
orthogonal  system.  An  orthogonal  system  is  called  normalised  if 
1 1  <fin  1 1  =  1  for  all  n ;  in  other  words, 


¥>1,  <P2,  <pn,  ••  • 

is  an  orthonormal  system  of  functions  if 

,  fO  for  t  ^  k, 

( <Pi ,  <Pk)  =  /  <pi{x)<pk{x)  dn  =  j 

(1  for  i  =  k. 

Examples.  1.  A  classical  example  of  an  orthonormal  system  of 
functions  on  the  segment  [  —  tt,  ■k~\  is  the  trigometric  system: 

1  cos  x  cos  2x  sin  x  sin  2x 

\/2t’  -\/w  ’  \^k’  ' 


2.  The  polynomials 


Pn(x) 


!  d"[(a:2  —  1)"] 


2"n! 


dx" 


n  =  0,  1,  2, 


which  are  called  Legendre  polynomials  form  an  orthogonal  sys¬ 
tem  of  functions  on  the  segment  [  —  1,  1].  The  functions 


4 


2n  +  1 


Pn(x) 


form  an  orthonormal  system. 
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It  is  easy  to  see  that  an  orthogonal  system  of  functions  is  linearly 
independent.  Indeed,  multiplying  the  relation 

cm  +  cm  +  •  •  •  4-  Cnpn  =  0 

by  <pi  and  integrating,  we  obtain 

Ci(<Pi,  <Pi)  =  0, 

and  since  ( <pi ,  <p{)  >0,  we  get  c<  =  0. 

Let  us  further  note  that,  if  in  the  space  L2  there  exists  an  every¬ 
where  countable  dense  set  fi,  /2,  •••,/»,  •  •  •,  then  any  orthonormal 
system  of  functions  {<pa\  is  at  most  countable. 

Indeed,  let  a  ^  ft,  then 

li  <Pa  —  <P»\\  =  \/2. 

For  each  a  let  us  select  from  our  everywhere  dense  set  an  fa  in 
such  a  way  that 

ik-Mi  <^=. 

It  is  clear  that  /«  ^  /3  if  a  ^  /3,  and,  since  the  set  of  all  the  /„  is 
countable,  the  set  of  the  <pa  themselves  is  also  not  more  than 
countable. 

In  studying  finite-dimensional  spaces  an  important  role  is 
played  by  the  concept  of  the  orthogonal  normalised  basis,  i.e., 
the  orthogonal  system  of  unit  vectors,  the  linear  closure  of  which 
coincides  with  the  whole  space.  In  the  infinite  case  the  analogue 
of  such  a  basis  is  the  complete  orthonormal  system  of  functions, 
i.e.,  a  system 


‘P 1,  <P2,  <pn,  *  •  •, 

such  that 

1)  (Vi,  Vk)  =  5 ik, 

2)  M  (951,  9^2,  <Pn,  '•  ’)  =  L2. 

Above  we  have  given  examples  of  complete  orthonormal  sys¬ 
tems  of  functions  on  the  segments  [  —  ir,  71-]  and  [  —  1,  1].  The 
existence  of  a  complete  orthonormal  system  of  functions  in  any 
separable  space  L2  follows  from  the  following  theorem: 
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Theorem.  Let  the  system  of  functions 

fuh>  •••,/»,  (6) 

be  linearly  independent.  Then  there  exists  a  system  of  functions 

<pi,  <te,  <Pn,  •  •  •  (7) 

satisfying  the  following  conditions : 

1 )  the  system  (7 )  is  orthonormal ;  * 

2)  every  function  <pn  is  a  linear  combination  of  functions 
fi,  f  2j  •••,/»: 

<Pn  =  Uni  f \  +  anz  +  •  •  •  +  annfn, 

where  a„„  ^  0; 

3)  every  function  /„  is  a  linear  combination  of  functions 

<Pl,  <pl,  •  '  <Pn- 

fn  —  b„itpi  +  b„2ip  +  •  •  •  +  bnn<Pn, 

where  b„„  ^  0. 

Every  function  of  the  system  (2)  is  uniquely  determined  {up  to 
the  sign)  by  conditions  l)-3). 

Proof.  The  function  ^i(t)  is  uniquely  determined  (up  to  sign) 
by  the  conditions  of  the  theorem.  Indeed, 


<Pl  —  On/i, 

(<Ph  <Pi)  =  ®h2(/i,  /i)  =  1, 


which  yields 


bn  = 


V(/i,/i)  and 


<P  i 


/i 


Let  the  functions  <Pk(k  <  n)  satisfying  conditions  1  )-3 )  be 
already  found.  Then  /„  can  be  represented  in  the  form 


fn  —  bnlpl  T  ’  ’  *  T  bntn—l<Pn—l  T  hn, 


where  (h„,  <pk)  =  0  for  k  <  n. 
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Obviously  (h„,  h„ )  >  0  (the  assumption  (h„,  hn)  =0  would 
have  resulted  in  a  contradiction  to  the  linear  independence  of 
the  system  (6)). 

Let  us  set  , 

hn 

Pn  =  V(hn,  K) 

Then  we  have 


((Pn,  <Pi )  0,  i  R, 

(“Pn,  <Pn)  =  1, 


fn  —  bn(pi  +  •  •  •  +  bnn<pn, 


bnn  —  \/  (bn,  hn)  ^  0, 


i.e.,  the  function  <p„(x)  satisfies  the  conditions  of  the  theorem. 

The  process  of  going  over  from  the  system  (6 )  to  the  system 
(7)  which  satisfies  conditions  l)-3)  is  called  the  orthogonalisation 
process. 

Obviously, 


fl,  fl,  )  =  M  ((pi,  <p2,  •  -  • ,  <pn,  •  •  • ) 

and,  therefore,  the  systems  (6 )  and  (7 )  are  either  both  complete 
or  both  incomplete. 

Thus,  in  any  problem  connected  with  the  approximation  of  the 
function  /  by  linear  combinations  of  the  functions  (6),  one  can 
replace  the  system  (6 )  by  an  orthonormal  system  (7 )  which  has 
been  obtained  from  (6)  by  an  orthogonalisation  process. 

As  we  have  already  said  earlier,  from  the  existence  of  a  count¬ 
able  everywhere  dense  set  in  L2  there  follows  the  existence  of  a 
countable  dense  system  of  linearly  independent  functions. 
Qrthogonalising  this  system  we  obtain  a  complete  dense  count¬ 
able  orthonormal  system. 

22.  Fourier  Series  on  Orthogonal  Systems. 

Riesz-Fischer  Theorem. 

Introducing  in  the  n-dimensional  Euclidean  space  R(n>  an 
orthogonal  normalised  basis  e\,  e2,  •••,  e„,  we  can  write  each 
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vector  x  £  R{n)  in  the  form 

n 

x  =  ckek]  (1) 

k=  1 

where 

ck  =  ( x ,  ek). 

The  content  of  this  section  is,  in  a  well  known  sense,  a  general¬ 
isation  to  the  case  of  an  infinite  dimensional  space  of  the  de¬ 
composition  (1 ).  Let 

<Pl,  <P2,  *  *  *,  <pn,  *  •  ■  (2) 

be  an  orthonormal  system,  and  /  €  L2. 

We  shall  pose  the  following  problem:  for  a  given  n  select  the 
coefficients  ak  (k  =  1,2,  •••,»)  in  such  a  way  that  the  distance, 
in  the  sense  of  the  metric  of  the  space  L2,  between/ and  the  sum 


Sn  =  22  ak<pk  (3) 

be  as  small  as  possible.  Set  ck  =  (f,  <pk).  Since  the  system  (2)  is 
orthonormal, 

11/  -  S„||2  =  (/  -  ak<pk,  f  -  X)  ak<fik) 

=  (/,  /)  -2 (  /,  X  <Xk<fik)  +  (X  <Xk<Pk,  X  aj<Pj) 

\  A^=l  /  \  k=l  7=1  / 


2  X  akck  +  X 


X  c*2  +  X)  —  c*)2-  (4) 


It  is  clear  that  this  expression  takes  on  its  minimum  when  the 
last  term  is  equal  to  zero,  i.e.,  for 


ak  =  ck,  k  =  1,  2,  •  •  •,  n. 

In  this  case 

II/-&II2  =  (/,/)-  x>2. 

fc=l 

Definition.  The  numbers 


(5) 

(6) 


°k  =  ( /,  <Pk) 
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are  called  the  Fourier  coefficients  of  the  function  /  £  L2  on  the 
orthogonal  system  (2)  and  the  series 


22  Ck-Vk 

k= 1 

(it  may  not  converge )  is  called  the  Fourier  series  of  the  function 
on  the  system  (2). 

We  have  shown  that  among  all  the  sums  (3 )  for  given  n  the 
one  which  differs  least  from/  (in  the  sense  of  the  L2  metric)  is 
the  partial  sum  of  the  Fourier  series  of  this  function.  Geometri¬ 
cally  this  result  can  be  explained  in  the  following  way.  The 
function 

» 

/  —  22  ak<Pk 
k- 1 

is  orthogonal  to  all  linear  combinations  of  the  form 

22  /3m-, 

i.e.,  it  is  orthogonal  to  the  subspace  generated  by  the  functions 
951,  952,  •  •  • ,  <pn,  in  that  and  only  in  that  case  when  condition  (5 )  is 
satisfied.  (Check  this! )  Thus,  the  result  just  obtained  is  a  general¬ 
isation  of  a  well  known  theorem  of  elementary  geometry:  the 
length  of  the  perpendicular  from  a  point  to  a  line  on  a  plane  is 
smaller  than  the  length  of  any  of  the  inclined  lines  drawn  from 
the  same  point. 

Since  always  ||  /  —  Sn  ||2  >0,  equation  (4)  implies 

i>2  <  ii/ii2- 

A;=l 

Here  n  is  arbitrary,  and  the  right  side  does  not  depend  on  n ;  hence 

oo 

the  series  22  c*2  converges  and 

k=l 

12  ci-2  <  II/II2-  (7) 

k*=l 

This  inequality  is  called  Bessel’s  inequality. 

Let  us  introduce  the  following  important  concept. 
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Definition.  An  orthonormal  system  (2)  is  called  closed  if  for 
every  function  /  £  L2  the  equation 


E  <*2  =  11/11  (8) 

it=l 

holds,  which  one  calls  Parseval’s  equation. 

One  can  see  from  (6)  that  the  fact  that  the  system  (2)  is 
closed  is  equivalent  to  the  fact  tjiat  the  partial  sums  of  the 
Fourier  series  of  each  function/  £  L2  converge  to/  in  the  sense 
of  the  L2  metric  (i.e.,  in  the  mean). 

The  concept  of  closure  of  an  orthonormal  system  is  closely 
connected  with  the  concept  of  completion  of  systems  of  functions 
introduced  in  §21. 

Theorem  1.  In  L2  space  every  complete  orthornomal  system  is 
closed,  and  conversely. 

Proof.  Let  {<pn(x)\  be  closed;  then,  whatever  the  function 
f  £  L2  may  be,  the  sequence  of  partial  sums  of  its  Fourier  series 
converges  to  it  in  the  mean.  This  means  that  the  linear  combina¬ 
tions  of  elements  of  the  system  { <^„( x ) )  are  everywhere  complete 
in  L2,  i.e.,  { <?„}  is  complete.  Conversely,  let  {<pK}  be  complete, 
i.e.,  one  can  approximate  any  function  /  £  L2  as  closely  as  de¬ 
sired,  in  the  sense  of  the  L2  metric,  by  a  linear  combination 

N 

E  ak<Pk 

k=l 

of  elements  {<Pk}‘,  then  the  partial  sum 

n 

E  Ck<Pk 

k= 1 

of  the  Fourier  series  for  /  gives,  generally  speaking,  an  even  closer 
approximation  of  the  function  /  and,  therefore,  the  series 


converges  to  /  in  the  mean,  and  Parseval’s  equality  holds. 
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In  §21  we  have  shown  the  existence  of  complete  orthonormal 
systems  in  L2.  In  as  much  as  the  concepts  of  closure  and  com¬ 
pletion  coincide  for  orthonormal  systems  of  functions  in  L2,  the 
existence  of  closed  orthogonal  systems  in  L2  does  not  require  a 
separate  proof,  and  the  examples  of  complete  orthonormal  sys¬ 
tems  given  in  §21  are  at  the  same  time  examples  of  closed  systems. 

From  the  Bessel  inequality  (7)  it  follows  that  in  order  that  the 
numbers  ch  c2,  •  •  •  be  Fourier  coefficients  of  some  function 
/  f  L2on  some  orthonormal  system  it  is  necessary  that  the  series 

oo 

E  Ck2 

k= 1 

converge.  It  turns  out  that  this  condition  is  not  only  necessary 
but  also  sufficient.  Indeed,  the  following  theorem  holds. 

Theorem  2.  (Riesz-Fischer).  Let  { <pn\  be  an  arbitrary  orthonormal 
system  in  L2  and  let  the  numbers 

Ci,  ci,  •••,  cn, 

be  such  that  the  series 

E  c*2  (9) 

fc=I 

converges.  Then  there  exists  a  function  f  £  L2  such  that 

Ck  =  (/,  Vk), 

and 

E  Ck2  =  (/,/). 

A;=l 

Proof.  Set 

n 

fn  =  E  Cklpk- 

A;=l 

Then 

n+p 

||/n+p  fn ||2  ||  Cn+l^n+1  “1“  *  *  *  "4"  Cn+p^n+pl)2  ^  ^  Cfc2* 

k—  n-f-1 

Since  the  series  (9)  converges,  this  implies,  on  the  strength  of  the 
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completeness  of  L2,  that  the  sequence  {/» }  converges  in  the  mean 
to  some  function/  €  L2.  Moreover, 

(/,  Vt)  =  (/»,  Pi)  +  (/-/.,  Vi),  (10) 

where  the  first  term  on  the  right  is  equal  to  c,-  for  n  >  i  and  the 
second  term  tends  to  zero  an  n  — >  »  since 

\(.f-fn,Vi)\  <  11/  -/.II* Ml- 

The  left  side  of  equation  (10)  does  not  depend  on  n;  therefore, 
taking  the  limit  as  n  — >  * ,  we  obtain 

(/,  Vi)  = 

Since,  by  the  definition  of  f(x), 

||/  -  /n||  — >  0  for  n  -»•  ®, 

the  equation 


£c*2  =  (/,/) 

is  satisfied  by  /.  The  theorem  is  proved. 

Let  us  in  conclusion  establish  the  following  useful  theorem. 

Theorem  3.  In  order  that  the  orthonormal  system  of  functions  (2) 
be  complete,  it  is  necessary  and  sufficient  that  there  does  not  exist  a 
function  in  L2  which  is  orthogonal  to  all  functions  of  the  system  (2) 
and  which  is  not  equivalent  to  't'(x)  =  0. 

Proof.  Let  the  system  (2)  be  complete  and  therefore  closed.  If 
/  €  L2  is  orthogonal  to  all  functions  of  the  system  (2),  than  all  its 
Fourier  coefficients  are  equal  to  zero.  Then,  from  Parseval’s  equa¬ 
tion  we  obtain 

(/,/)  =  2>2  =  o, 

i.e.,f(x)  is  equivalent  to  ^(x)  =  0. 

Conversely,  let  {/„}  be  not  complete,  i.e.,  let  there  exist  a 
function  g  £  L2  such  that 

oo 

(g,  g)>Z  Ck2, 


where  ck  =  ( g ,  <pk ) ; 
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then,  by  the  Riesz-Fischer  theorem,  there  exists  a  function 
/  £  L2,  such  that 

(  /,  <Pk)  =  e*,  ( /,  /)  =  £  c*,2. 

The  function  /  —  g  is  orthogonal  to  all  <p„  By  the  inequality 

(/>  /)  =  £  c* 2  <  (S',  fif), 

it  cannot  be  equivalent  to  'I'(x)  =  0.  The  theorem  is  proved. 


23.  The  Isomorphism  of  the  Spaces  Ls  and  /2 

From  the  Riesz-Fischer  theorem  immediately  follows  the 
following  important 

Theorem.  The  space  L2  is  isomorphic*  to  the  space  l2. 

Proof.  Let  us  select  in  L2  an  arbitrary  complete  orthonormal 
system  [<pn]  and  let  us  put  into  correspondence  with  every  func¬ 
tion/  £  L2a  sequence  ch  c2,  •  •  •,  c„,  •  •  •  of  its  Fourier  coefficients 
on  this  system.  Since  ck2  <  °o ,  (cu  c2,  •  •  • ,  c„,  •  •  • )  is  there¬ 
fore  some  element  of  l2.  Conversely,  by  virtue  of  the  Riesz- 
Fischer  theorem,  there  corresponds  to  every  element  (ci,  c2,  ■  •  •, 
cn ,  •  •  ■ )  of  l2  some  function  /  of  L2,  which  has  the  numbers  cu 
c2,  •  ••,  cn,  •  •  •  as  its  Fourier  coefficients.  The  correspondence  be¬ 
tween  the  elements  of  l2  and  L2  which  we  have  established  is  one 

*  Two  Euclidean  spaces  R  and  R'  are  said  to  be  isomorphic  if  one  can  establish  a 
one  to  one  correspondence  between  their  elements  in  such  a  way  that  if 

X  x',  y  <->  y' , 

then 

1)  x  +  y  *-*  x'  +  y' , 

2)  ax  <->  ax' , 

3)  (x,  y)  =  (x1,  y'). 

It  is  clear  that  two  isomorphic  Euclidean  spaces,  considered  just  as  metric  spaces,  are 
measurable. 
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to  one.  Moreover,  if 

/™  «_>  (Clo ))C|(i>,  c„®,  •••) 

and 


then 


'  (2) 


(ft®, 


C2 


(2) 


Cn 


(2) 


•), 


/  ®  +  / <2)  «-»  (ft®  +  ft®,  ft®  +  ft®,  •  •  •,  c.®  +  ft®,  •  •  • ) 

and 

fc/  ®  (Aft®,  Aft®,  •  •  •,  Aft®,  •  •  • ), 

i.e.,  the  sum  goes  over  into  a  sum,  and  the  product  by  a  number 
goes  over  into  the  product  of  a  corresponding  element  by  the 
same  number.  Finally,  the  Parseval  equation  yields  that 


(/(i),/(2>)  =  £  ft®ft®.  (i) 

n=l 

Indeed,  from  the  fact  that 

(/(1>,/(1))  =  £(Cn®)2,  (/®,/(2))  =  £  (ft®)2, 

and 

(/C1)  +/(2),/(1)  +/<2))  =  +2(/®,/®)  +(/®,/®) 

=  £  (ft(1)  +  c„(2))2  +  £  (ft®)2  +  2  £  ft®ft®  +  £  (cB®)2 

(1 )  follows.  Thus  the  correspondence  which  we  have  established 
between  the  elements  of  the  space  L2  and  U  is  indeed  an  iso¬ 
morphism.  The  theorem  is  proved. 

This  theorem  means  that  one  can  consider  l2  to  be  a  “coordinate 
notation”  of  the  space  L2.  It  allows  us  to  carry  over  facts  we  have 
established  earlier  for  l2  to  L2.  For  example,  we  have  shown  in 
Chapter  III  of  Volume  I  that  every  linear  functional  in  l2  has 
the  form 

<p(x)  =  (x,  y), 

where  y  is  some  element  of  l2,  uniquely  defined  by  the  functional 
<p.  From  this,  and  from  the  isomorphism  we  have  established  be- 


23.  THE  ISOMORPHISM  OF  SPACES  L2  AND  l2 


117 


tween  L2  and  U,  it  follows  that  every  linear  functional  in  l2  has 
the  form 


<p(f)  =  (f,g)=J f(x)g(x)  dn, 

where  g(x)  is  some  fixed  function  from  L2.  In  §24,  Volume  I  it 
was  established  that  l2  =  l2.  This,  and  the  theorem  just  obtained, 
yield  L2  =  L2. 

The  isomorphism  established  between  L2  and  l2  is  closely  re¬ 
lated  to  certain  problems  in  quantum  mechanics.  At  first,  quan¬ 
tum  mechanics  appeared  in  the  form  of  two  externally  different 
theories:  Heisenberg’s  “matrix  mechanics”  and  Schrodinger’s 
"wave  mechanics”.  Later,  Schrodinger  established  the  equiva¬ 
lence  of  the  two  theories.  From  the  purely  mathematical  point  of 
view  the  difference  between  the  two  theories  is  reduced  to  the 
fact  that  in  constructing  the  corresponding  mathematical  ap¬ 
paratus  Heisenberg  used  the  space  U  and  Schrodinger  the  space  L2. 


CHAPTER  V 


THE  ABSTRACT  HILBERT  SPACE.  INTEGRAL 
EQUATIONS  WITH  A  SYMMETRIC  KERNEL. 


In  the  preceding  chapter  we  have  established  that  the  spaces 
L2  (in  the  case  of  a  separable  measure)  and  U  are  isomorphic, 
i.e.,  in  essence  represent  two  different  realisations  of  one  and  the 
same  space.  This  space,  usually  called  Hilbert  space,  plays  an 
important  part  in  analysis  and  its  applications.  Often  it  is  useful 
not  to  tie  oneself  down  ahead  of  time  by  one  or  another  realisa¬ 
tion  of  this  space,  but  to  define  it  axiomatic  ally,  as  is  done  for 
example  in  linear  algebra  with  respect  to  the  n-dimensional 
Euclidean  space. 


24.  Abstract  Hilbert  Space 

Definition  1 .  The  set  H  of  elements  f,g,h,  •  •  ■  of  an  arbitrary 
nature  is  called  (abstract)  Hilbert  space,  if  the  following  con- 
ditipns  are  satisfied: 

I.  H  is  a  linear  space. 

II.  A  scalar  product  of  elements  is  defined  in  H,  i.e.,  to  each 
pair  of  elements  /  and  g  there  corresponds  a  number  (/,  g )  such 
that, 

1)  (/,  g)  =  (g,f), 

2)  («/,  g)  =  «(/,  g), 

3)  (/i  +/2,  g)  =  ( /i>  g)  +  (/2,  g), 

4)  (/,/)  >0  if  /  ^  0. 

In  other  words,  conditions  I  and  II  mean  that  H  is  a  Euclidean 
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space.  The  number  ||/||  =  V  (/,  /)  is  called  the  norm  of  the 
element  /. 

III.  The  space  H  is  complete  in  the  sense  of  the  p(f,  g)  = 
||/  —  g  ||  metric . 

IV.  The  space  H  is  infinite-dimensional,  i.e.,  for  any  natural 
n,  one  can  find  in  H,  n  linearly  independent  vectors. 

V.  The  space  H  is  separable*,  i.e.,  there  exists  in  it  a  count¬ 
able  everywhere  dense  set. 

It  is  easy  to  give  examples  of  spaces  which  satisfy  all  the 
enumerated  axioms.  Such  is  the  space  h  which  we  considered  in 
Chapter  II  of  Volume  I.  Indeed,  it  is  a  Euclidean  space,  which  is 
infinitely  dimensional  since,  for  example,  its  elements 

«i  =  (1,  0,  0,  •  •  •,  0,  •  •  •), 
e2  =  (0,  1,0,  •••,(),  •••), 
e3  =  (0,  0,  1,  •  •  •,  0,  •  •  •). 

are  linearly  independent;  the  fact  that  it  is  complete  and  sep¬ 
arable  was  proved  in  Chapter  II,  §9  and  §13  of  Volume  I.  The 
space  L2  of  functions  which  are  square  integrable  with  respect  to 
some  separable  measure  which  is  equivalent  to  U  also  satisfies  the 
same  axioms. 

The  following  statement  holds:  all  Hilbert  spaces  are  isomorphic. 

To  prove  this  fact  it  obviously  suffices  to  establish  that  every 
Hilbert  space  is  isomorphic  to  the  coordinate  space  l2.  This  last 
assertion  is  proved  in  essence  by  the  same  considerations  as  the 
isomorphism  of  the  spaces  L2  and  h,  namely: 

1 )  One  can  carry  over  to  Hilbert  space  without  any  changes 
those  definitions  of  orthogonality,  closure  and  completeness  which 
were  introduced  for  the  elements  of  the  space  L2  in  §21. 

2)  Selecting  in  the  Hilbert  space  H  a  countable  everywhere 
dense  set  and  applying  to  it  'the  process  of  orthogonalisation  de¬ 
scribed  (for  L2)  in  §21,  we  construct  in  H  a  complete  ortho¬ 
normal  system  of  elements,  i.e.,  the  system 

hi,  h2,  h„,  •  •  •,  (1) 

*  Condition  V  is  often  omitted,  i.e.,  one  considers  nonseparable  Hilbert  spaces. 
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which  satisfies  the  conditions 

fO  for  i  ^  k, 

a)  (hi,  hk)  =  | 

(l  for  i  =  k, 

b)  the  linear  combinations  of  elements  of  the  system  (1)  are 
everywhere  dense  in  H. 

3)  Let  F  be  an  arbitrary  element  in  H.  Set  ck  =  (J,hk).  Then, 
the  series  21  ckl  converges  and  Ck2  =  (/>  /)  for  any  complete 
orthonormal  system  { hk }  and  any  element  f  H. 

4)  Let,  again,  [hk\  be  some  complete  orthonormal  system  of 
elements  in  H.  Whatever  the  sequence  of  numbers 


Cl,  Ci,  •  •  • ,  c»,  •  •  • 

may  be  which  satisfy  the  condition 

21  ck2  < 

there  exists  in  H  an  element/  6  H  such  that 

Ck  =  (/,  hk), 


and 

I Zcf  =  (/,/). 

5)  From  our  statements  we  can  see  that  one  can  realise  an 
isomorphic  mapping  of  H  onto  h,  by  setting 

/  «-»  (ch  Cl,  •••,  C„,  •••), 


where 


and 


ck  =  (/,  hk), 


hi,  h2,  •  ",  hn,  ■  •  • 

is  an  arbitrary  complete  orthonormal  system  in  H. 

It  is  recommended  that  the  details  of  the  proof  be  completed 
by  the  reader  himself  using  the  methods  of  §§21-23. 
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25.  Subspaces.  Orthogonal  Complements.  Direct  Sum. 

In  correspondence  with  the  general  definition  of  §21,  Chapter 
III,  of  Volume  I  we  shall  call  a  set  L  of  elements  in  H  having  the 
property :  if  f,g£L,  then  af  +  pg  £  L  for  any  numbers  a  and  ft, 
a  linear  manifold.  A  closed  linear  manifold  is  called  a  subspace. 

Let  us  give  a  few  examples  of  subspaces  of  Hilbert  space. 

1 .  Let  h  be  an  arbitrary  element  in  H.  The  set  of  all  elements 
/  £  H  which  are  orthogonal  to  h  forms  a  subspace  in  H. 

2.  Let  H  be  realised  as  l>,  i.e.,  let  its  elements  be  sequences 
(xi,  x2,  •  •  • ,  x„,  •  •  • )  of  numbers  such  that  x^  <  «s .  The  ele¬ 
ments  which  satisfy  the  condition  Xi  =  x2  form  a  subspace. 

3.  Let  H  be  realised  as  the  space  L2  of  all  square  integrable 
functions  on  some  segment  [a,  6]  and  let  a  <  c  <6.  Let  us  de¬ 
note  the  set  of  all  functions  from  H  which  are  identically  equal  to 
zero  on  the  segment  [a,  c]  by  Hc.  Hc  is  a  subspace  of  the  space  H. 
If  Ci  <  c2,  then  HCl  D  HCi\  moreover,  Ha  =  H,  Hh  =  (0).  Thus 
we  obtain  a  continuum  of  subsets  of  H  which  are  contained  in 
each  other.  Each  of  these  subspaces  (of  course  with  the  exception 
of  Hb)  is  infinite-dimensional  and  isomorphic  to  the  whole  space  H. 

It  is  recommended  that  the  reader  check  that  the  sets  of 
vectors  given  in  examples  1-3  are  indeed  subspaces. 

Every  subspace  of  a  Hilbert  space  is  either  a  finite-dimen¬ 
sional  Euclidean  space  or  itself  a  Hilbert  space.  Indeed,  it  is 
obvious  that  axioms  I— III  are  fulfilled  for  each  such  subspace, 
and  the  following  lemma  shows  that  axiom  IV  also  holds. 

Lemma.  From  the  existence  of  a  countable  everywhere  dense  set  in  a 
metric  space  R  there  follows  the  existence  of  a  countable  everywhere 
dense  set  in  an  arbitrary  subspace  R'  of  R. 

Proof.  Let 

L,  •••,£„,  •  •  • 

be  a  countable  everywhere  dense  set  in  R.  Let,  furthermore, 

an  =  inf  p(£n,  *7), 

1 ieRf 
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and  i7„  e  R'  be  such  that 


p(£n,  tfn)  ^  2a„.  . 

For  any  t;  £  R',  one  can  find  an  n  such  that 

p(£»>  V)  < 

and,  therefore, 

.  .  '  2e 

P\£nj  Vn)  ^  g  > 

then  p( v,  v«)  <  s  be.,  the  countable  set 

vu  •  •  4,Vn, 

is  everywhere  dense  in  R'. 

The  fact  that  a  scalar  product  exists  in  Hilbert  space  and  the 
concept  of  orthogonality  makes  it  possible  to  supplement  essen¬ 
tially  the  results  given  in  the  first  part  of  the  course*  in  connec¬ 
tion  with  closed  linear  subspaces  of  arbitrary  Banach  spaces. 

Applying  the  orthogonality  process  to  an  arbitrary  countable 
everywhere  dense  sequence  of  elements  of  an  arbitrary  subspace 
of  Hilbert  space  we  obtain 

Theorem  1.  In  every  subspace  M  of  the  space  H  there  exists  an 
orthogonal  system  { )  the  linear  closure  of  which  coincides  with  M : 

M  =  M(<ph  932,  •  •  •,  <p„,  •  ••)■ 

Let  AT  be  a  subspace  of  the  Hilbert  space  H.  Let  us  denote  by 

M'  =  H  ©  M 

the  set  of  elements  g  £  H  which  are  orthogonal  to  all  /  (E  M,  and 
let  us  show  that  M'  is  also  a  subspace  of  the  space  H. 

Since  {gu  f)  =  (g2,  f)  =  0  implies  (<*&  +  a2g  2,  /)  =  0,  it  is  clear 
that  M'  is  linear.  To  prove  that  it  is  closed  assume  that  g„  be- 


Translater’s  Note:  The  reference  is  to  Functional  Analysis,  Volume  I,  [A], 
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longs  to  M'  and  converges  to  g.  Then,  for  any  f  £  M 
( 9,f  )  =  lim  (gn,  f  )  =  0, 

n->co 

and  therefore  g  also  belongs  to  M' . 

M'  is  called  the  orthogonal  complement  of  the  subspace  M. 
Theorem  1  obviously  implies 

Theorem  2.  If  M  is  a  closed  linear  subspace  of  the  space  H,  then 
any  element  f  6.  H  can  be  uniquely  represented  in  the  form  f  = 
h  +  h',  where  h  £  M,  h'  £  M' . 

Proof.  Let  us  first  show  that  such  a  decomposition  exists.  To  do 
this  let  us  find  a  complete  orthonormal  system  {ipn\  in  M  (so 
that  M  =  M  { <pn } )  and  set 


co 

h  ^  Cn<pnj  Cn  ~  * 

n=l 

Since  (by  Bessel’s  inequality),  the  series  of  c„2  converges,  the 
elements  h  exist  and  belong  to  M.  Set 

h'  =}  -  h. 


Obviously,  for  any  n 

{h' ,  9S„)  =  0, 

and,  since  any  element  of  M  can  be  represented  in  the  form 

=  y  j  Q'ntpnj 


we  have  for  f  €  M 


(h\  n  =  £  (h\  <pn)  =  o. 


Let  us  now  assume  that  there  exists,  aside  from  the  decomposi¬ 
tion/  =  h  +  h’  constructed' by  us,  another  decomposition: 

f  =  h  +  hi ',  hi  <E  M,  hf  e  M'. 

Then,  for  all  n, 

{hi,  <Pn)  —  (  /,  <Pn)  =  Cn, 
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which  implies  that 

h\  =  h,  hi  =  h'. 

Theorem  2  yields 

Corollary  1.  The  orthogonal  complement  to  the  orthogonal  comple¬ 
ment  of  a  closed  linear  subspace  M  coincides  with  M  itself. 

Thus,  we  can  speak  of  mutually  complementary  subspaces  of 
the  space  H.  If  M  and  M'  are  two  such  closed  linear  subspaces 
which  complement  one  another,  and  {pn},  Wm'\  are  complete 
orthogonal  systems  in  M  and  M' ,  respectively,  then  the  union  of 
the  systems  {<pn\  and  {<pm'\  gives  a  complete  orthogonal  system 
in  the  whole  space  H.  Therefore  we  have 

Corollary  2.  Every  orthonormal  system  {^„ )  can  be  extended  to  a 
system  which  is  complete  in  H. 

If  the  system  \<pn)  is  finite,  then  the  number  of  functions  it 
contains  is  the  dimension  of  the  space  M  and  at  the  same  time 
the  index  of  the  space  M'.  Thus  we  obtain 

Corollary  3.  The  orthogonal  complement  to  a  space  of  finite  di¬ 
mension  n  has  index  n,  and  conversely. 

If  every  vector  /  £  M  is  represented  in  the  form  f  =  h  +  h', 
h  £  M,  h'  £  M'  (. M '  is  the  orthogonal  complement  of  M ),  then 
one  says  that  H  is  the  direct  sum  of  mutually  orthogonal  sub¬ 
spaces  M  and  M',  and  one  writes 

H  =  M  ©  M'. 

It  is  clear  that  the  concept  of  a  direct  sum  can  be  immediately 
generalised  to  any  finite,  or  even  countable,  number  of  subspaces: 
one  says  that  H  is  the  direct  sum  of  its  subspaces 

H  —  Mi  ©  M2  ©  *  •  •  ©  M n  ©  *  *  * , 
if: 

1)  the  subspaces  are  pairwise  orthogonal  i.e.,  any  vector 
from  Mi  is  orthogonal  to  any  vector  from  Mk  for  i  k; 
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2)  every  element  /  G  M  can  be  represented  in  the  form 

f  =  hi  h2  hn  ,  hn  G  Mn.  (1) 

Moreover,  if  the  number  of  subspaces  Mn  is  infinite,  then 
22 1|  hn  ||2  is  a  convergent  series.  One  can  easily  see  that  there 
is  only  one  possible  such  representation  for  any  /  and  that 

ll/li2  =  El!  a.  II*. 

n 

Exactly  as  one  speaks  of  a  direct  sum  of  subspaces  so  one  can 
also  speak  of  a  direct  sum  of  a  finite  or  countable  number  of 
arbitrary  Hilbert  spaces.  Namely,  if  Hi  and  H2  are  two  Hilbert 
spaces,  their  direct  sum  H  is  defined  in  the  following  way:  the 
elements  of  the  space  H  are  all  possible  pairs  (hu  h2),  where 
hi  G  Hi,  h2  G  H2,  and  the  scalar  product  of  two  such  pairs  equals 

((hi,  hi),  (h/,  W))  =  (hi,  h/)  +  (h2,  hi'). 

The  space  H  obviously  contains  the  mutually  orthogonal  sub¬ 
spaces  which  consist  of  pairs  of  the  form  (hu  0)  and  (0,  h2),  re¬ 
spectively;  the  first  of  them  can  be  identified,  in  a  natural  way, 
with  the  space  Hi  and  the  second  with  the  space  H2. 

Analogously  one  defines  the  sum  of  an'  arbitrary  finite  number 
of  spaces.  The  sum  H  =  22  ©  Hn  of  a  countable  number  of  spaces 

n 

Hi,  Hi,  •••,  Hn,  ••■is  defined  in  the  following  way:  the  ele¬ 
ments  of  the  space  H  are  all  possible  sequences  of  the  form 

h  (hi,  hi,  • ,  hn,  •  •  • ) 

for  which 

E IIM2  <  °° •  . 

n' 

The  scalar  product  ( h ,  g)  of  the  elements  h  and  g  of  H  equals 

(kny  Qn)  • 
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26.  Linear  and  Bilinear  Functionals  in  Hilbert  Space 

The  fact  that  every  Hilbert  space  is  isomorphic  to  the  space  h 
allows  us  to  carry  over  a  series  of  facts,  established  in  Chapter  III 
of  Volume  I  for  l2,  to  the  abstract  Hilbert  space. 

Since  in  l2  every  linear  functional  has  the  form 

<p(x)  =  ( x ,  a), 

where  a  is  an  element  from  l2, 

any  linear  functional  F(h)  in  H  can  be  represented  in  the  form 

F(h)  =  (h,  g),  (1) 

where  g  depends  only  on  F. 

This  implies  that  the  definition  of  weak  convergence,  intro¬ 
duced  in  Chapter  III  of  Volume  I  for  an  arbitrary  linear  space, 
applied  to  the  space  H,  can  be  formulated  in  the  following  way. 

The  sequence  of  elements  hn  of  the  space  H  converges  weakly 
to  h  £  H,  if : 

1 )  the  norms  ||  hn  ||  are  bounded*, 

2 )  for  every  g  €  H 

(hn,  g)  -»  ( h ,  g). 

Any  orthonormal  sequence 

<Ph  Vi,  <Pn, 

in  FI  converges  weakly  to  zero,  since  for  any  h  £  M 
Cn  =  (K,  <Pn)  — >  0  for  U  •  >  , 

because 

Zv  <  (h,h)  <  00. 

Moreover,  such  a  sequence  does  not,  of  course,  converge  in  norm 
to  any  limit. 

In  particular,  applying  this  remark  to  the  case  when  H  is  the 
space  of  square  integrable  functions  on  the  segment  [a,  6]  of  the 
real  axis  with  the  usual  Lebesgue  measure,  we  obtain  the  follow¬ 
ing  interesting  fact. 


See  the  note  on  page  77  of  Volume  X. 
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Let 


<P2  (0,  •  *  ",  <Pn(t),  •  •  • 


be  some  orthonormal  system  of  functions  in  this  space,  and  let 


Then, 


m 


1  on  the  segment  [ t ,  tf]  C  [a,  fe], 
0  outside  [th  tf]. 


(f,<Pn) 


dt. 


Thus,  for  any  orthonormal  system  of  functions  <pn(t)  and  any 
h  and  tz  from  [a,  6], 


dt  — >  0. 


If  the  (pn{t)  are  bounded  in  the  set,  we  see  from  this  fact,  taking 
into  account  the  condition 


that  <pn(t)  with  a  large  index  inevitably  changes  its  sign  many 
times  (which  can  be  observed,  for  example,  in  the  case  of  a 
trigonometric  system). 

In  Chapter  III  of  Volume  I  we  introduced,  together  with  weak 
convergence  of  elements  of  a  linear  normalised  space,  also  the 
concept  of  weak  convergence  of  a  sequence  of  functionals.  In  as 
much  as  the  Hilbert  space  coincides  with  its  conjugate,  the  two 
concepts  are  identical  for  it;  therefore,  Theorem  1  of  §28  of 
Volume  I  gives  the  following  result  for  Hilbert  space: 

The  unit  sphere  in  H  is  weakly  compact,  i.e.,  me  can  select  from 
each  sequence  of  elements  £  H  for  which  ||  tpn  ||  =  la  weakly 
converging  subsequence. 

In  what  follows  we  shall  also  need  the  following 

Theorem  1 .  If  £„  converges  weakly  to  %  in  H,  then 

Hill  <  SUP  llinll- 
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Proof.  Whatever  the  complete  orthonormal  system  {<pk\  in  II 
may  be,  we  have 


ck  =  ({,  <Pk)  =  lim  (£„,  <pk)  =  lim  c„k, 

n-»oo  n-*co 


and 

k  k  oo 

52  Cm2  =  lim  51  °nm2  <  SUp  5)  Cnm', 

m= 1  n-»oo  m=l  m=  1 

therefore 


5^  cm2  <  sup  51  c„m2, 

m=l  n  m=l 

which  proves  the  theorem. 

Let  B(f,  g )  be  a  real  function  of  a  pair  of  vectors  from  H  which 
satisfies  the  following  condition:  for  fixed  g,  B(f,  g)  is  a  linear 
functional  in/,  and  for  fixed  /  it  is  a  linear  functional  in  g.  B(f,g) 
is  called  the  bilinear  functional.  The  bilinear  functional  B(f,  g) 
is  called  symmetric,  if 

B(f,  g)  =  B(g,f)  for  any  f,  g  £  H. 

From  the  theorem  concerning  the  general  form  of  a  linear  func¬ 
tional  in  H  it  follows  that  every  bilinear  functional  in  H  can  be 
written  in  the  form 


B{f,  g )  =  (L  g), 

where  $•  depends  on/.  It  is  easy  to  see  that  the  correspondence 

/^r 

is  a  continuous  linear  operator  in  H\  let  us  denote  it  by  A.  Thus, 

B(f,  g)  =  {A},  g).  (2) 

Analogously,  we  can  obtain  another  expression: 


B(f,  g)  =  (/,  A*g), 
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where  A  *  is  another  linear  operator,  which  one  calls  adjoint*  to  A. 
If  the  functional  B(f,  g )  is  symmetric,  then 

(Af,  g)  —  B(f,  g)  =B(g,f)  =  (Ag,  f  )  =  ( f,  Ag ), 

i.e., 

A  =  A*.  (3) 

A  linear  operator  satisfying  condition  (3 )  is  called  self-adjoint. 
Formula  (2)  establishes  a  one  to  one  correspondence  between 
the  bilinear  functionals  and  continuous  linear  operators  in  H. 
Moreover,  the  symmetric  bilinear  functionals  correspond  to  the 
self-adjoint  operators  and  vice  versa. 

Setting  /  =  <7  in  a  symmetric  bilinear  functional,  we  obtain  the 
so  called  quadratic  functional 

QU)  =  B{f,f ); 

by  (2) 

QU)  =  ( Af,f ), 

where  A  is  a  self-adjoint  linear  operator. 

Since  the  correspondence  between  the  symmetric  bilinear 
functionals  and  the  quadratic  functionals  is  one  to  one,t  the 
correspondence  between  quadratic  functionals  and  self-adjoint 
linear  operators  is  also  one  to  one. 

27.  Completely  Continuous  Self-Adjoint  Operators  in  H 

In  Chapter  IV,  Volume  I,  we  introduced  the  concept  of  a  com¬ 
pletely  continuous  linear  operator  which  acts  in  some  Banach 
space  E.  In  this  section  we  shall  supplement  the  facts  which  were 

*  In  Chapter  III  of  Volume  I,  considering  linear  operators  in  an  arbitrary  Banach 
Space  E,  we  defined  the  operator  A*,  adjoint  to  some  operator  A,  with  the  help  of 
equation 

(Ax,  ip)  =  ( X ,  A* ip) 

for  all  x  £  E,  £  E.  If  E  is  the  Hilbert  space,  E  =  E  and  the  definition  of  the 
operator  A *,  introduced  in  Chapter  III  of  Volume  I,  goes  over  into  the  definition 
formulated  here. 

t  QU)  =  and,  conversely: 

B(f,g)  =  \lQU  +  9)  -QU-an 

4 
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established  for  arbitrary  completely  continuous  operators,  limit¬ 
ing  ourselves  only  to  self-adjoint  completely  continuous  operators, 
acting  in  a  Hilbert  space. 

Let  us  remember  that  we  called  an  operator  A  completely  con¬ 
tinuous  if  it  transformed  every  bounded  set  into  a  compact  set. 
In  as  much  as  H  =  H,  i.e.,  H  is  conjugate  to  a  separable  space, 
all  the  bounded  sets  (and  only  those)  are  weakly  compact  in  it; 
therefore,  in  the  case  of  Hilbert  space,  the  definition  of  complete 
continuity  can  be  formulated  in  the  following  way: 

The  operator  A  acting  in  the  Hilbert  space  H,  is  called  com¬ 
pletely  continuous  if  it  transforms  every  weakly  compact  set  into 
a  compact  (in  norm)  set. 

In  Hilbert  space  this  is  equivalent  to  the  fact  that  the  operator 
A  transforms  every  weakly  converging  sequence  into  a  strongly 
converging  one. 

In  this  section  we  shall  establish  the  following  basic  theorem 
which  is  a  generalisation,  to  self-adjoint  completely  continuous 
operators,  of  a  theorem  stating  how  matrices  of  a  self-adjoint 
linear  transformation  in  n-dimensional  space  may  be  brought  to 
diagonal  form. 

Theorem  1 .  For  any  completely  continuous  self-adjoint  linear 
operator  A  in  Hilbert  space  H  there  exists  an  orthonormal  system 
{ tpn }  of  eigenvectors  corresponding  to  the  eigenvalues  { X„ }  such  that 
every  element  |  £  H  may  be  written  in  a  unique  way  as  £  =  'fFlcl#>k  +  £', 
where  £'  satisfies  the  condition  A  £'  =  0 ;  moreover,  k 

d£  =  2Z  h]£i,ipk, 

k 


and  lim  \n  =  0. 

n=oo 

For  the  proof  of  this  basic  theorem  we  shall  need  the  following 
auxiliary  assertion: 

Lemma  1 .  If  [  £„ )  converges  weakly  to  £  and  the  self-adjoint  linear 
operator  A  is  completely  continuous,  then 
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Proof. 

I  (Alin,  £n)  —  (Aj,  £)  |  <  |  (A£n,  Jn)  —  (Ajn,  J)  |  +  |  (A£„,  J)  ~  (Aj,  £)  |  . 
But 

I  [Ain,  in)  -(AJ.,J)  I  =  |(J»,  A  (in  ~  J))|  <  IU»IH|A  ({.  -  J)  ||, 
and 

I  (A£„,  J)  —  (Aj,  J)  |  =  |  (J,  A(J„  —  J))  |  <  |klM|A(&.  —  f) ||7 
andsince  the  quantities  ||  £„  ||  are  bounded  and  ||  A  —  j)  ||  — »  0, 
|(AJ„,  J„)  -  (Aj,  J)|  -»0, 
which  was  to  be  proved. 

Lemma  2.  If  the  functional 

|Q(J)I  =  l(AJ,  J)|, 

where  A  is  a  bounded  self-adjoint  linear  operator,  attains  its  max¬ 
imum  on  the  unit  sphere  at  the  point  J0,  then 

(Jo,  v)  =  0 

implies  that 

(A v)  =  (Jo,  A-q)  =  0. 

Proof.  Obviously  ||  Jo  ||  =  1.  Let  us  set 

t  —  Jo  +  aV 
-  VI  +  a2||„|!2  ’ 

where  a  is  an  arbitrary  number.  From  ||  £0  ||  =  1  it  follows  that 

lull  <  L 

Since 

Q( J)  =  ,  .  2 1 1  i|2  W(Jo)  +  2a(A£0,  n)  +  a2Q(»;)]  , 

1  +  a2||i?||2 

we  have  for  small  a 

Q( J)  =  Q(Jo)  +2a(dJ0,  v)  +  0(a2). 
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The  last  equation  makes  it  clear  that  if  (A£0,  v)  ^  0,  then  a 
can  be  selected  in  such  a  way  that  |  Q(£)  |  >  |  Q(£0)  |,  which 
contradicts  the  condition  of  the  theorem. 

Lemma  2  immediately  implies  that  if  |  Q(  £ )  |  attains  its  max¬ 
imum  for  £  =  £0,  then  £0  is  an  eigenvector  of  the  operator  A. 

Proof  of  the  Theorem.  We  shall  construct  the  elements  <pk 
by  induction,  in  order  of  decreasing  absolute  values  of  the  cor¬ 
responding  eigenvalues : 

N  >  |x2|  >  •••  >  |xn|  >  •••. 

To  construct  the  element  <p%  let  us  consider  the  expression 
Q(  £)  =  |  (A  £,  £ )  |  and  let  us  prove  that  it  attains  a  maximum  on 
the  unit  sphere.  Let 

S  =  sup  |  (A£,  £)  | 
lUll^i 

and  £i,  £2,  •  •  •  be  a  sequence  for  which  ||  £„  ||  <1  and 
|  ( A£„,  £„)  |  — >  S  for  n  — >  » . 

Since  the  unit  sphere  is  weakly  compact  in  H,  one  can  select  from 
{ £n }  a  subsequence  which  converges  weakly  to  some  element  r;. 
Here  moreover,  by  Theorem  1  of  §26,  1 1  ??  1 1  <1  and,  by  Lemma  1, 


!  (Ai),  ri)  |  =  S. 

Lel£  us  take  the  element  -q  for  <pi.  It  is  clear  that  1 1  q 
equal  to  unity.  Hence 


which  yields 


Aip\  —  \vpi, 


is  exactly 


M 


(dyi,  y>i)  | 

(<»ii  <Pi) 


|  (Aipi,  .pi)  |  -  S. 


Assume  now  that  the  eigenvectors 


VU  <P2,  •  *  *7  <Pn, 

corresponding  to  the  eigenvalues 

Xi,  X2,  — ,  Xn, 
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are  already  constructed.  Let  us  consider  the  functional 

\(M,  £)| 

on  the  set  of  elements  which  belong  to 


Mn'  =  H  ©  M(<pi,  Ifii,  •••,  <pn) 

(i.e.,  orthogonal  to  <pi,  <pi,  •  •  •,  <p„)  and  which  satisfy  the  condition 
||  £  ||  <  1.  Mr!  is  a  subspace  which  is  invariant  with  respect  to  A 
(since  M(<pi,  <pz,  •••,  <pn)  is  invariant  and  A  is  self-adjoint).  Ap¬ 
plying  the  above  considerations  we  see  that  one  can  find  a  vector 
in  Mn'  which  is  an  eigenvector  of  the  operator  A ;  let  us  call  it 

(Pn+ 1. 

Two  cases  are  possible:  1)  After  a  finite  number  of  steps  we 
obtain  a  subspace  M in  which  (A  £,  £ )  =  0 ;  2 )  {A  £,  £ )  ^  0  on 
MV  for  all  n. 

In  the  first  case,  we  see  from  Lemma  2  that  the  operator  A 
transforms  M„0'  into  zero,  i.e.,  M„0'  consists  wholly  of  eigenvectors 
which  correspond  to  X  =  0.  The  system  of  vectors  { «sB }  which  we 
have  constructed  consists  of  a  finite  number  of  elements. 

In  the  second  case  we  obtain  the  sequence  {p„)  of  eigenvectors 
for  each  of  which  X„  ^  0.  We  shall  show  that  Xn  — >  0.  The  se¬ 
quence  \<pn\  (as  every  orthonormal  sequence)  converges  weakly 
to  zero,  therefore  the  elements  Aipn  =  Xr.^„  must  converge  to  zero 
with  respect  to  the  norm;  hence  X„  =  ||  Aipn  ||  — >  0. 

Let 

M'  =  H  ©  M{ipn]  =  0  tf.V  0. 


If  £  €  M'  and  £  ^  0,  then 


i.e., 


(d£,  £)  <  X„||£||2  for  all  n, 

(A£,  £)  =  0, 


from  which,  by  Lemma  2  (for  max  |  (A  £,  £ )  |  =  0 )  applied  to 

llfllsl 

M',  we  see  that  A£  =  0,  i.e.,  the  operator  A  transforms  the  sub¬ 
space  M'  into  zero. 


134  V.  THE  ABSTRACT  HILBERT  SPACE.  INTEGRAL  EQUATIONS 


From  the  construction  of  the  system  \<pn)  it  is  clear  that  every 
vector  can  be  represented  in  the  form 

£  =  5Z  Ck<Pk  +  £',  where  A£'  =  0, 

k 

which  implies  that 

2  ^ tflktpk ■ 
k 

28.  Linear  Equations  with  Completely  Continuous  Operators 

Let  us  consider  the  equation 

£  =  cd£  +  i),  (1) 

where  A  is  a  completely  continuous  self-adjoint  operator,  the 
element  q  £  H  is  given,  and  £  £  H  is  sought  for. 

Let 

<pl,  <P2,  •  •  • ,  <Pn, 

be  eigenvectors  of  the  operator  A  corresponding  to  the  eigen¬ 
values  which  are  different  from  zero.  Then  q  can  be  written  in 
the  form 

V  =  X)  +  J?',  (2) 

n 

where  Aq'  =  0.  Let  us  look  for  the  solution  of  equation  (1 )  in 
the  form 

£  =  S  +  £7,  (3) 

n 

where  =  0.  Inserting  (2)  and  (3)  into  (1),  we  obtain 
y '  (1  Xnc)  (pn  -f*  £  v  dnpn  “1“  V  • 

n  n 

This  equation  is  satisfied  if  and  only  if 


£'  =  v', 

( 1  Xnc)  Un, 
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Xn  =  —  for  Xn  5^  -  (4) 

1  —  X„e  c 

1 

a„  =  0  for  X„  =  -  . 

c 

The  last  equality  gives  the  necessary  and  sufficient  condition  for 
equation  (1)  to  be  solvable  and  (4)  determines  the  solution.  The 
values  xn,  corresponding  to  the  n  for  which  X„  =  1/c,  remain 
arbitrary. 


29.  Integral  Equations  with  a  Symmetric  Kernel 


The  results  described  in  the  preceding  section  can  be  applied  to 
integral  equations  with  symmetric  kernels,  i.e.,  to  equations  of 
the  form 


f(t)  =  <p(t)  +  f  K(t,  s)f(s)  ds, 

•'a 

where  K(t,  s)  satisfies  the  conditions 

1)  K(t,  s)  =  K(s,  t), 

2)  J  J  K2(t,  s)  dt  ds  <  oo. 


(1) 


The  application  of  the  results  of  §28  to  equations  of  the  form 
(1 )  is  based  on  the  following  theorem. 


Theorem.  Let  R  be  some  space  with  a  measure  n  prescribed  on  it. 
If  the  function  K(t,  s )  defined  on  R2  =  R  X  R  satisfies  the  condi¬ 
tions 


K(t,  s)  =  K(s ,  t ) 


(2) 
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and 


f  K2{t,  s)  dn2  <  oo  ,  m2  =  M  X  m  , 

•'e2 


i/ie  operator 


9  = 


given  in  the  space  L2{R,  n)  by  the  formula 


(3) 


g(t)  =  f  K(t,  s)f(s)  ds, 

JR 

is  completely  continuous  and  self-adjoint. 

Proof.  Let  us  denote  the  space  L2(ft,  n )  by  L2.  Let  {y l/n(t) }  be  a 
complete  orthonormal  system  in  L2.  The  set  of  all  possible  prod¬ 
ucts  ypn{t)  fm(s)  is  a  complete  system  of  functions  in  ft2,  and 

m  n 

where 


Omn  Omm 


(by  virtue  of  (2 ) ) ,  and 

Z  Z  amn2  =  f  K2(t,  s)  d»2  <  oo. 

m  n  Jr2 


Set 


then 


/(S)  =  Z  bnin(s), 


g{x)  =  Af  =  Z(Z  Omnbn)  ^pm  (*^)  Z  cmipm(x). 


Moreover, 


Cm* 


=  (z  Umn&n)  <  Z  (W**  Z  ^  =  !!/! 

\  n= 1  /  n=l  n=l 


r-a„2, 


where 
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Since  the  series 


&mn 


converges,  one  can  find,  for  any  e  >  0,  an  m0  such  that 


dm?  ^  Ej 

m= m0+l 


i 

|  g(x)  -  J^,Cm^m(x) 


£  Cm2<£  ||/||2. 

77i— m0+l 


Now  let  { fw\  converge  weakly  to  /.  Then  the  set  cm(k)  con¬ 
verges  to  cm  for  every  m,  hence  the  sum 


z 


771=1 


Cm  ^m(^r) 


converges  in  the  mean  for  any  fixed  m0  to  the  sum 

mp 

^  '  Cm  ^m(^r)  . 

m-1 

By  inequahty  (4)  and  the  boundedness  of  the  norm  \\f(h)\\  this 
implies  that  \g(k)(x) }  (where  g{k)  =  Af(-k))  converges  in  the  mean 
to  g(x),  which  proves  the  complete  continuity  of  the  operator  A. 
Furthermore,  from  condition  (1)  and  Fubini’s  theorem  it  follows 
that 

( Af ,  g )  =  jR{jR  K(s>  0/(0  diM-gis)^  dMs 


=  J  /(<)(/  K(s>  0 g(s)  dn}jdnt  =  ( f ,  Ag), 

i.e.,  the  operator  A  is  self-adjoint.  The  theorem  is  proved.  In  this 
way  the  problem  of  solving  an  integral  equation  with  a  kernel 
satisfying  conditions  (2 )  and  (3 )  is  reduced  to  finding  eigenfunc¬ 
tions  and  eigenvalues  of  the  corresponding  integral  operator.  The 
practical  solution  of  this  last  problem  usually  requires  the  use  of 
some  limiting  methods,  the  description  of  which  is  beyond  the 
scope  of  this  book. 


ADDITIONS  AND  CORRECTIONS  TO 
VOLUME  I* 


After  our  first  book  appeared,  a  certain  number  of  misprints, 
some  of  which  were  fundamental,  and  various  inadequacies  of 
presentation  were  discovered.  Below  we  list  corrections  of  the 
errors  found. 

(1)  Page  28,  line  23  from  above.  Instead  of  Ga  read  Gjx). 

(2)  Page  46,  lines  23-24  from  above.  Instead  of  “the  method 
of  successive  approximations  is  not  applicable”  it  should  read 
“the  method  of  successive  approximations  is,  generally  speaking, 
not  applicable”. 

(3)  Page  49,  after  line  22  insert,  “where  for  fo(x)  one  can  take 
any  continuous  function”. 

(4)  Page  51,  in  line  2  from  the  top  for  M  write  M2,  in  line  3 
from  the  top  in  place  of  M  write  Mn  (twice),  and  in  line  5  from 
the  top  for  \n  write  \nMn. 

(5)  Page  55,  line  15  from  above.  For  e/5  write  “less  than  e/5. 

(6)  Page  56,  fine  13  from  above.  Instead  of  “of  the  region  G” 
it  should  read  “of  the  bounded  region  G”. 

(7)  Page  57,  line  28  from  above.  For  <p(x)  write  <pw(x). 

(8)  Page  61,  before  Theorem  7,  one  should  insert:  “We  shall 
call  the  mapping  y  =  f(x )  uniformly  continuous  if,  for  any  e  >  0, 
one  can  find  a  S  >  0  such  that  p(/(xi),  f  (x2) )  <  e  for  all  xh  x2 
for  which  p(x i,  x2)  <  S.  The  following  assertion  holds:  Every 
continuous  mapping  of  a  compactum  into  a  compadum  is  uniformly 
convergent.  It  is  proved  by  the  same  method  as  the  uniform  con¬ 
tinuity  of  a  function  which  is  continuous  on  a  segment.” 


*  The  additions  and  corrections  listed  here  arc  in  terms  of  the  page  numbering  of 
the  English  translation  of  Volume  I,  published  by  Graylock,  Press. 
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(9)  Page  61.  After  line  17  from  above,  instead  of  the  word 
“Proof”  it  should  read: 

“Let  us  first  prove  the  necessity.  If  D  is  compact,  then  there 
exists  in  D  a  finite  s/3  net/i,/2,  ■  •  • ,  />■■  Since  each  of  the  mappings 
fi  is  continuous,  it  is  uniformly  continuous,  therefore  one  can 
find  5  >  0  such  that 

p(/»(*i), /«(%))<  ^  ,  *  =  1,  2,  ...,n, 


only  if 


p(x i,  x2)  <  5. 


For  every  mapping/  6  D  one  can  find  an  /,  for  which 


P(f,  U)  <  g£  • 


Then, 

p(/Oi),/(zi))  <  p(/0 i),/((*i))  +  p(/<(xi), /,•(%)) 

+  p(fi(xi)>  /(*»))  <3_^3_^3  =  £ 

only  if  p(xi,  x2)  <  5i,  and  this  means  that  the  mappings  /  €  D 
are  uniformly  continuous.  Let  us  now  prove  the  sufficiency.” 

(10)  Page  72,  line  3  from  the  bottom.  Instead  of  “max”  read 

“sup”.  l-"-“ 

l<»^oo 

(11)  Page  77,  line  9  from  above.  Instead  of  “continuous”  it 
should  read  “continuous  at  the  point  x".  In  line  11  from  above 
instead  of  “|/(xi)  —  f(x2)  |”  read  “|/(x)  —  /(x0)  |”  and  in  line 
13  instead  of  “||  x1  —  x2 1|'’  read  “||  x  —  x0 1|.” 

(12)  Page  80,  line  4  of  Theorem  3.  Instead  of  “x0  (£  L/’  read, 
“ x0  is  a  fixed  element,  not  belonging  to  Lt”. 

(13)  Page  84,  line  12  from  above.  Instead  of  “sup„x„”  read 

“SUP„  |  Xn  |”. 
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(14)  Page  92,  lines  26-27  from  above.  The  statement  that 
“these  functionals  are  everywhere  dense  in  Cla,  is  not  true. 
Instead  of  “satisfies  the  conditions  of  Theorem  1,  i.e.,  linear 
combinations  of  these  functionals  are  everywhere  dense  in 
C[ai»]”  read:  “has  the  property  that  if  the  sequence  {«„(<)}  is 
bounded  and  <p{xn)  — >  <p(x)  for  all  ^  €  A,  then  {£„(<))  converges 
weakly  to  x(t)". 

(15)  Page  94.  The  metric  introduced  here  leads  to  weak  con¬ 
vergence  of  functionals  in  every  bounded  subset  of  the  space  R 
(but  not  in  all  of  R).  In  lines  32-33,  after  the  words  “so  that”, 
insert,  “in  every  bounded  subset  of  the  space  R”. 

(16)  The  proof  of  Theorem  5  given  here  contains  a  mistake. 
It  should  be  replaced  by  the  following: 

“Proof.  1)  Let  us  note  first  of  all  that  every  eigenvalue  of  a  con¬ 
tinuous  operator  which  is  different  from  zero  has  finite  multi¬ 
plicity.  Indeed,  the  set  Ex  of  all  eigenvectors  which  correspond  to 
a  given  eigenvalue  x  is  a  linear  space,  the  dimension  of  which  is 
equal  to  the  multiplicity  of  this  eigenvalue.  If,  for  some  X  ^  0, 
this  subspace  were  infinite,  the  operator  A  would  not  be  com¬ 
pletely  continuous  in  E*,  and,  therefore,  also  in  the  whole  space. 

2)  Now  to  complete  the  proof  of  the  theorem  it  remains  to 
show  that  whatever  the  sequence  { X„ }  of  pairwise  different  eigen¬ 
values  of  a  completely  continuous  operator  A,  X„  — >  0  for  n  — >  w. 
Let  xn  be  an  eigenvector  of  the  operator  A,  corresponding  to  the 
eigenvalue  X„.  The  vectors  x„  are  linearly  independent.  Let 
E„(n  =  1,2,  •  •  • )  be  a  subspace  consisting  of  all  elements  of  the 

n 

form  y  =  £  For  every  y  £  En  we  have 
1=1 


1  .  y->  v-'  a /.  k»\ 

—  —  Ay  =  2^  a&t  -  -7—  Xi  =  [1  -  —  torn 

An  i=*l  »=1  An  \  An/ 


which  shows  that  y  —  Ay /Xn  £  En_  1. 
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Let  us  select  a  sequence  of  elements  \yn)  in  such  a  way  that 

t/n  £  En,  llv.ll  =  1,  pit/nj  En— l)  >  —  . 


(The  possibility  of  selecting  such  a  sequence  is  shown  on  page 
118). 

Let  us  now  assume  that  the  sequence  l/x„  is  bounded.  Then 
the  set  { A(y„/\n ) )  must  be  compact,  but  this  is  impossible  since, 
for  p  >  q, 


iA*’  +  A©. 


because 

Vp  ~  ~  AVp  +  d(^)  €  Ep. i. 


The  contradiction  we  have  obtained  proves  our  assertion.” 

(17)  Page  119,  line  12  from  above.  The  statement  that  G0  is  a 
Subspace  is  correct  but  is  not  evident.  Therefore  one  should  change 
the  sentence:  “Let  G0  be  the  subspace  consisting  of  all  the  ele¬ 
ments  of  the  form  x  —  Ax."  into:  “Let  Go  be  a  linear  manifold 
consisting  of  all  elements  of  the  form  x  —  Ax.  Let  us  show  that 
Go  is  closed.  Let  f  be  a  one  to  one  mapping  of  the  factor  space 
E/N  (where  N  is  the  subspace  of  elements  satisfying  the  condi¬ 
tion  x  —  Ax  =  0)  on  G0.  We  must  show  that  the  inverse  mapping 
T~l  is  continuous.  It  suffices  to  show  that  it  is  continuous  at  the 
point  y  =  0.  Assume  that  this  is  not  the  case;  then  there  exists  a 
sequence  yn  — >  0  such  that,  for  £„  =  f~lyn,  the  inequality  ||  £„  ||  > 
p  >  0  is  satisfied.  Setting  =  £„/||  \\,zn  =  yn/\\  £n||,  we  obtain 

the  sequence  {»/„}  which  satisfies  the  conditions  ||  Vn  ||  =  1,  = 

zn  —>  0.  Selecting  in  each  class  yn  a  representative  xn,  in  such  a  way 
that  1 1  xn  1 1  <  2,  we  obtain  a  bounded  sequence  which  satisfies 
the  condition:  zn  =  Tx„  =  xn  —  Axn  — >  0.  But,  since  the  operator 
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A  is  completely  continuous,  {Axn }  contains  a  continuous  funda¬ 
mental  subsequence  [Axnk\]  the  sequence  xp  =  zp  +  Axp  is  also 
fundamental  and,  therefore,  it  converges  to  some  element  x0. 
Hence  zp  =  Txp  — >  Tx 0,  which  implies  Tx 0  =  0.  i.e. ,  xa  €  N.  But 
then,  ||  tjp  ||  <  ||  xp  —  x0 1|  — >  0,  which  contradicts  the  condition 
||  ijp  ||  =  1.  The  contradiction  obtained  shows  that  the  mapping 
T -1  is  continuous,  and,  therefore  also  that  Go  is  closed.  Hence  G 0 
is  a  subspace.” 
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